Fully homomorphic encryption

ABSTRACT

In one exemplary embodiment of the invention, a method and computer program include: receiving first and second ciphertexts having first and second data encrypted per an encryption scheme, the encryption scheme has public/secret keys and encryption, decryption, operation and refresh functions, the encryption function encrypts data, the decryption decrypts ciphertext, the operation receives ciphertexts and performs operation(s) on them, the refresh operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, utilizing a modulus switching technique that involves transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, the technique includes scaling by p/q and rounding, p&lt;q; using the operation function(s), performing operation(s) on them to obtain a third ciphertext; and reducing a noise level of the third ciphertext using the refresh function.

CROSS-REFERENCE TO RELATED APPLICATIONS

This patent application claims priority under 35 U.S.C. §119(e) from Provisional Patent Application No. 61/481,048, filed Apr. 29, 2011, the disclosure of which is incorporated by reference herein in its entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

This invention was made with Government support under contract no. FA8750-11-C-0096 awarded by Defense Advanced Research Projects Agency (DARPA). The Government has certain rights to this invention.

TECHNICAL FIELD

The exemplary embodiments of this invention relate generally to encryption/decryption schemes, algorithms, techniques, methods, computer programs and apparatus and, more specifically, relate to homomorphic encryption schemes, algorithms and apparatus.

BACKGROUND

This section endeavors to supply a context or background for the various exemplary embodiments of the invention as recited in the claims. The content herein may comprise subject matter that could be utilized, but not necessarily matter that has been previously utilized, described or considered. Unless indicated otherwise, the content described herein is not considered prior art, and should not be considered as admitted prior art by inclusion in this section.

Encryption schemes that support operations on encrypted data (aka homomorphic encryption) have a very wide range of applications in cryptography. This concept was introduced by Rivest et al. shortly after the discovery of public key cryptography [21], and many known public-key cryptosystems support either addition or multiplication of encrypted data. However, supporting both at the same time seems harder, and until recently attempts at constructing so-called “fully homomorphic” encryption turned out to be insecure.

BRIEF SUMMARY

In one exemplary embodiment of the invention, a computer-readable storage medium storing program instructions, execution of the program instructions resulting in operations comprising: receiving a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; performing at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and reducing a noise level of the third ciphertext by using the refresh function.

In another exemplary embodiment of the invention, a method comprising: receiving a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; performing at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and reducing a noise level of the third ciphertext by using the refresh function.

In a further exemplary embodiment of the invention, an apparatus comprising: at least one processor configured to receive a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; and at least one memory configured to store the first ciphertext and the second ciphertext, where the at least one processor is further configured to perform at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and to reduce a noise level of the third ciphertext by using the refresh function.

In another exemplary embodiment of the invention, an apparatus comprising: means for receiving a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; means for performing at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and means for reducing a noise level of the third ciphertext by using the refresh function.

In a further exemplary embodiment of the invention, a computer-readable storage medium storing program instructions, execution of the program instructions resulting in operations comprising: receiving a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of the magnitude of noise for a ciphertext while maintaining the modulus of the ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; performing at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and reducing a noise level of the third ciphertext by using the refresh function.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

The foregoing and other aspects of embodiments of this invention are made more evident in the following Detailed Description, when read in conjunction with the attached Drawing Figures, wherein:

FIG. 1 illustrates a block diagram of an exemplary system in which various exemplary embodiments of the invention may be implemented;

FIG. 2 illustrates a simple block diagram of a requestor and a server (e.g., devices, apparatus, computer programs, systems), such as a search engine, that use the fully homomorphic encryption scheme constructed from a bootstrappable encryption scheme in accordance with the exemplary embodiments of this invention;

FIG. 3 depicts a logic flow diagram illustrative of the operation of an exemplary method, and the operation of an exemplary computer program, in accordance with the exemplary embodiments of this invention; and

FIG. 4 depicts a logic flow diagram illustrative of the operation of another exemplary method, and the operation of another exemplary computer program, in accordance with the exemplary embodiments of this invention.

DETAILED DESCRIPTION 1 Introduction

1.1 Fully Homomorphic Encryption

A fully homomorphic encryption scheme may be considered as one that allows the computation of arbitrary functions over encrypted data without requiring the use of a decryption key.

There has existed an open problem of constructing a fully homomorphic encryption scheme. This notion, originally called a privacy homomorphism, was introduced by Rivest, Adleman and Dertouzous (R. Rivest, L. Adleman, and M. Dertouzous. On data banks and privacy homomorphisms. In Foundations of Secure Computation, pages 169-180, 1978) shortly after the development of RSA by Rivest, Shamir, and Adleman (R. Rivest, A. Shamir, and L. Adleman. A method for obtaining digital signatures and public-key cryptosystems. In Comm. of the ACM, 21:2, pages 120-126, 1978). Basic RSA is a multiplicatively homomorphic encryption scheme, i.e., given RSA public key pk=(N,e) and ciphertexts {ψ_(i)←π_(i) ^(e) mod N}, one can efficiently compute Π_(i)ψ_(i)=(Π_(i)π_(i))^(e) mod N, a ciphertext that encrypts the product of the original plaintexts. One may assume that it was RSA's multiplicative homomorphism, an accidental but useful property, that led Rivest et al. to ask a natural question: What can one do with an encryption scheme that is fully homomorphic: a scheme ε with an efficient algorithm Evaluate_(ε) that, for any valid public key pk, any circuit C (not just a circuit consisting of multiplication gates as in RSA), and any ciphertexts ψ_(i)←Encrypt_(ε)(pk,π_(i)), outputs ψ←Evaluate_(ε)(pk,C,ψ ₁, . . . , ψ_(t)),

a valid encryption of C(π₁, . . . , π₁) under pk? Their answer: one can arbitrarily compute on encrypted data, i.e., one can process encrypted data (query it, write into it, do anything to it that can be efficiently expressed as a circuit) without the decryption key. As an application, they suggested private data banks. A user can store its data on an untrusted server in encrypted form. Later, the user can send a query on the data to the server, whereupon the server can express this query as a circuit to be applied to the data, and use the Evaluate_(ε) algorithm to construct an encrypted response to the user's query, which the user then decrypts. One would obviously want the server's response here to be more concise than the trivial solution, in which the server just sends all of the encrypted data back to the user to process on its own.

It is known that one can construct additively homomorphic encryption schemes from lattices or linear codes. The lattice-based scheme and the Reed-Solomon-code-based scheme allow multiplications, though with exponential expansion in ciphertext size. Ciphertexts implicitly contain an “error” that grows as ciphertexts are added together. Thus, ciphertexts output by Evaluate do not have the same distribution as ciphertexts output by Encrypt, and at some point the error may become large enough to cause incorrect decryption. For this reason, the homomorphism is sometimes referred to as a “pseudohomomorphism” or a “bounded homomorphism”.

There are schemes that use a singly homomorphic encryption scheme to construct a scheme that can perform more complicated homomorphic operations (T. Sander, A. Young, and M. Yung. Non-interactive cryptocomputing for NC1. In Proc. of FOCS '99, pages 554-567, 1999, and Y. Ishai and A. Paskin. Evaluating Branching Programs on Encrypted Data. In Proc. of TCC '07. Sanders, Young and Yung (SYY) show that one can use a circuit-private additively homomorphic encryption scheme to construct a circuit-private scheme that can handle arbitrary circuits, where the ciphertext size increases exponentially with the depth of the circuit. Their scheme may, therefore, feasibly evaluate NC1 circuits. Ishai and Paskin show how to evaluate branching programs, and with much smaller ciphertexts than SYY. In their scheme Evaluate outputs a ciphertext whose length is proportional to the length of the branching program. This remains true even if the size of the branching program is very large, e.g., super-polynomial. However, the computational complexity of their scheme is proportional to the size.

In more detail, Ishai and Paskin use a “leveled” approach to evaluate a branching program. A (deterministic) branching program (BP) P is defined by a DAG from a distinguished initial node in which each nonterminal node has two outgoing edges labeled 0 and 1, and where the terminal nodes also have labels.

Fully homomorphic encryption (FHE) [21, 8] allows a computationally powerful worker to receive encrypted data and perforin arbitrarily-complex dynamically-chosen computations on that data while it remains encrypted, despite not having the secret decryption key. Until recently, all FHE schemes [8, 6, 22, 10, 5, 4] followed the same blueprint, the one laid out in Gentry's original construction [8, 7].

The first step in Gentry's blueprint is to construct a somewhat homomorphic encryption (SWHE) scheme, namely an encryption scheme capable of evaluating “low-degree” polynomials homomorphically. Starting with Gentry's original construction based on ideal lattices [8], there are by now a number of such schemes in the literature [6, 22, 10, 5, 4, 14], all of which are based on lattices (either directly or implicitly). The ciphertexts in all these schemes are “noisy”, with a noise that grows slightly during homomorphic addition, and explosively during homomorphic multiplication, and hence, the limitation of low-degree polynomials.

To obtain FHE, Gentry provided a remarkable bootstrapping theorem which states that given a SWHE scheme that can evaluate its own decryption function (plus an additional operation), one can transform it into a “leveled” FHE scheme. (In a “leveled” FHE scheme, the parameters of the scheme may depend on the depth of the circuits that the scheme can evaluate (but not on their size). One can obtain a “pure” FHE scheme (with a constant-size public key) from a leveled FHE scheme by assuming “circular security”—namely, that it is safe to encrypt the leveled FHE secret key under its own public key. We will often omit the term “leveled” in this work.) Bootstrapping “refreshes” a ciphertext by running the decryption function on it homomorphically, using an encrypted secret key (given in the public key or obtainable therefrom), resulting in reduced noise (a reduction of noise generated by the operations).

Until recently, SWHE schemes tended to be incapable of evaluating their own decryption circuits (plus some) without significant modifications. (We discuss recent exceptions [9, 3] below.) Thus, the final step is to squash the decryption circuit of the SWHE scheme, namely transform the scheme into one with the same homomorphic capacity but a decryption circuit that is simple enough to allow bootstrapping. Gentry [8] showed how to do this by adding a “hint”—namely, a large set with a secret sparse subset that sums to the original secret key—to the public key and relying on a “sparse subset sum” assumption.

A bootstrappable encryption scheme is one wherein the encryption scheme can evaluate its own decryption circuit (e.g., slightly augmented versions of its own decryption circuit). Gentry showed that if the decryption circuit of a SWHE scheme is shallow enough, in particular, if it is shallow enough to be evaluated homomorphically by the somewhat homomorphic scheme itself (a self-referential property), then this somewhat homomorphic scheme becomes “bootstrappable”, and can be used to construct a fully homomorphic scheme that can evaluate circuits of arbitrary depth.

It may be useful to provide a physical analogy as an aid in visualizing the concept of fully homomorphic encryption. Assume that the owner of a jewelry store wants her employees to assemble raw precious materials (diamonds, gold, etc.) into finished products, but is worried about theft. The owner addresses the problem by constructing glove boxes for which only the owner has the key (analogous to the secret key in an encryption scheme), and puts the raw materials inside the glove boxes (analogous to an encryption operation). Using the gloves, an employee can manipulate the items inside the box. Moreover, an employee can put things inside the box, e.g., a soldering iron to use on the raw materials, although the employee cannot take anything out. Also, the box is transparent, so that an employee can see what he is doing within the box. In this analogy, encryption means that the employee is unable to take something out of the box, not that he is unable to see it. After the employee is finished, the jewelry store owner can recover the finished product at her leisure by using her key. This analogy is inadequate in the sense that the glove box might become quite cluttered, whereas in the fully homomorphic encryption scheme only the final product need remain. In other words, to improve the analogy, imagine that the employee has some way to make any item in the glove box (of his choosing) disappear, even though he still cannot extract the item.

Now imagine that the glove boxes are defective; after an employee uses the gloves for one minute, the gloves stiffen and become unusable (analogous to the accumulation of noise). Unfortunately, even the fastest employee cannot assemble some of the more intricate designs in under a minute. To solve this problem the jewelry store owner gives to an employee that is assembling an intricate design a glove box containing the raw materials, but also several additional glove boxes. Each of these additional glove boxes holds a copy of the master key. To assemble the intricate design, the employee manipulates the materials in box #1 until the gloves stiffen. Then, he places box #1 inside box #2, where the latter box already contains a master key. Using the gloves for box #2, he opens box #1 with the master key, extracts the partially assembled item, and continues the assembly within box #2 until its gloves stiffen. He then places box #2 inside box #3, and so on. The employee finally finishes his assembly inside of box #n. Of course, this procedure assumes that the employee can open box #1 within box #(i+1), and have time to some progress on the assembly, all before the gloves of box #(i+1) stiffen. This is analogous to the requirement for a bootstrappable encryption scheme ε, that the complexity of ε's (augmented) decryption circuit is less than what ε can homomorphically evaluate.

The foregoing analogy assumes that it is safe to use a single master key that opens all boxes. However, perhaps an employee could use the gloves for box #2, together with master key inside that box, to open the box from the inside, extract the key, and use it to open box #1 and remove the jewels. However, this situation can be avoided by using distinct keys for the boxes, and placing the key for box #1 inside box #2, the key for box #2 inside box #3, and so on. This is analogous to the question of whether the encryption scheme is KDM-secure.

One non-limiting application of fully homomorphic encryption is in a two-party setting. A simple example is making encrypted queries to search engines. Referring to FIG. 2, to perform an encrypted search a party (requestor 1) generates a public key pk for the fully homomorphic encryption scheme, and generates ciphertexts ψ₁, . . . , ψ_(t) that encrypt the query π₁, . . . , π_(t) under pk. (For example, each π_(i) could be a single bit of the query.) Now, let the circuit C express a search engine server 2 search function for data stored in storage 3. The server 2 sets ψ_(i)*←Evaluate(pk, C_(i), ψ₁, . . . , ψ_(t)), where C_(i) is the sub-circuit of C that computes the ith bit of the output. Note that, in practice, the evaluation of and C_(i)* and C_(j)* may share intermediate results, in which case it may be needlessly inefficient to run independent instances of the Evaluate algorithm. The server 2 sends these ciphertexts to the requestor 1. It is known that, by the correctness requirement, Dectypt(sk,ψ_(i)*)=C_(i)(π₁, . . . , π_(t)). These latter values constitute precisely the answer to the query, which is recoverable through decryption.

As another non-limiting application, the exemplary embodiments of this invention enable searching over encrypted data. In this scenario, assume that the requestor 1 stores files on the server 2 (e.g., on the Internet), so that the requestor 1 can conveniently access these files without needing the requestor's computer. However, the requestor encrypts the files, otherwise the server 2 could potentially read the private data. Let bits π₁, . . . , π_(t) represent the files, which are encrypted in the ciphertexts ψ₁, . . . , ψ_(t). Assume then that the requestor 1 later wants to download all encrypted files that satisfy a query, e.g., all files containing the word ‘homomorphic’ within 5 words of ‘encryption’, but not the word ‘evoting’. The requestor 1 sends the query to the server 2, which expresses it as a circuit C. The server sets ψ_(i)* ←Evaluate(pk, C_(i), ψ₁, . . . , ψ_(t)) and sends these ciphertexts to the requestor 1. who decrypts the returned ciphertexts to recover C(π₁, . . . , π_(t)), the (bits of the) files that satisfy the query.

Note that in this application, as in the encrypted search application, the requestor preferably provides an upper bound on the number of bits that the response should have, and the encrypted response from the server 2 is padded or truncated to meet the upper bound.

Fully homomorphic encryption has numerous applications. For example, it enables private search engine queries where the search engine responds to a query without knowledge of the query, i.e., a search engine can provide a succinct encrypted answer to an encrypted (Boolean) query without knowing what the query was. It also enables searching on encrypted data; one can store encrypted data on a remote server and later have the server retrieve only files that (when decrypted) satisfy some Boolean constraint, even though the server cannot decrypt the files on its own. More broadly, fully homomorphic encryption improves the efficiency of secure multiparty computation.

1.2 Efficiency of FHE

The efficiency of fully homomorphic encryption has been a (perhaps, the) big question following its invention. In this paper, we are concerned with the per-gate computation overhead of the FHE scheme, defined as the ratio between the time it takes to compute a circuit homomorphically to the time it takes to compute it in the clear. (Other measures of efficiency, such ciphertext/key size and encryption/decryption time, are also important. In fact, the schemes we present in this paper are very efficient in these aspects (as are the schemes in [9, 3]).) Unfortunately, FHE schemes that follow Gentry's blueprint (some of which have actually been implemented [10, 5]) have fairly poor performance—their per-gate computation overhead is p(λ), a large polynomial in the security parameter. In fact, we would like to argue that this penalty in performance is somewhat inherent for schemes that follow this blueprint.

First, the complexity of (known approaches to) bootstrapping is inherently at least the complexity of decryption times the bit-length of the individual ciphertexts that are used to encrypt the bits of the secret key. The reason is that bootstrapping involves evaluating the decryption circuit homomorphically—that is, in the decryption circuit, each secret-key bit is replaced by a (large) ciphertext that encrypts that bit—and both the complexity of decryption and the ciphertext lengths must each be Ω(λ).

Second, the undesirable properties of known SWHE schemes conspire to ensure that the real cost of bootstrapping for FHE schemes that follow this blueprint is actually much worse than quadratic. Known FHE schemes start with a SWHE scheme that can evaluate polynomials of degree D (multiplicative depth log D) securely only if the underlying lattice problem is hard to 2^(D)-approximate. To achieve hardness against 2^(λ) time adversaries, the lattice must have dimension Ω(D·λ). This is because we have lattice algorithms in n dimensions that compute 2^(n/λ)-approximations of short vectors in time

. Moreover, the coefficients of the vectors used in the scheme have bit length Ω(D) to allow the ciphertext noise room to expand to 2^(D). Therefore, the size of “fresh” ciphertexts (e.g., those that encrypt the bits of the secret key) is {tilde over (Ω)}(D²·λ). Since the SWHE scheme must be “bootstrappable”—i.e., capable of evaluating its own decryption function—D must exceed the degree of the decryption function. Typically, the degree of the decryption function is Ω(λ). Thus, overall, “fresh” ciphertexts have size {tilde over (Ω)}(λ³). So, the real cost of bootstrapping—even if we optimistically assume that the “stale” ciphertext that needs to be refreshed can be decrypted in only Θ(λ)-time—is {tilde over (Ω)}(λ⁴).

The analysis above ignores a nice optimization by Stehlé and Steinfeld [24], which so far has not been useful in practice, that uses Chernoff bounds to asymptotically reduce the decryption degree down to O(√{square root over (λ)}). With this optimization, the per-gate computation of FHE schemes that follow the blueprint is {tilde over (Ω)}(λ³). (We note that bootstrapping lazily—i.e., applying the refresh procedure only at a 1/L fraction of the circuit levels for L>1—cannot reduce the per-gate computation further by more than a logarithmic factor for schemes that follow this blueprint, since these SWHE schemes can evaluate only log multiplicative depth before it becomes absolutely necessary to refresh—i.e., L=O(log λ).)

1.3 Recent Deviations from Gentry's Blueprint, and the Hope for Better Efficiency

Recently, Gentry and Halevi [9], and Brakerski and Vaikuntanathan [3], independently found very different ways to construct FHE without using the squashing step, and thus without the sparse subset sum assumption. These schemes are the first major deviations from Gentry's blueprint for FHE. Surprisingly, Brakerski and Vaikuntanathan [3] showed how to base security entirely on LWE (for sub-exponential approximation factors), avoiding reliance on ideal lattices.

From an efficiency perspective, however, these results are not a clear win over previous schemes. Both of the schemes still rely on the problematic aspects of Gentry's blueprint—namely, bootstrapping and an SWHE scheme with the undesirable properties discussed above. Thus, their per-gate computation is still more than {tilde over (Ω)}(λ⁴). Nevertheless, the techniques introduced in these recent constructions are very interesting and useful to us. In particular, we use the tools and techniques introduced by Brakerski and Vaikuntanathan [3] in an essential way to achieve remarkable efficiency gains.

An important, somewhat orthogonal question is the strength of assumptions underlying FHE schemes. All the schemes so far rely on the hardness of short vector problems on lattices with a subexponential approximation factor. Can we base FHE on the hardness of finding a polynomial approximation?

1.4 Our Results and Techniques

We leverage Brakerski and Vaikuntanathan's techniques [3] to achieve asymptotically very efficient FHE schemes. Also, we base security on lattice problems with quasi-polynomial approximation factors. (All previous schemes relied on the hardness of problems with sub-exponential approximation factors.) In particular, we have the following theorem (informal):

-   -   Assuming Ring LWE for an approximation factor exponential in L,         we have a leveled FHE scheme that can evaluate L-level         arithmetic circuits without using bootstrapping. The scheme has         {tilde over (Ω)}(λ·L³) per-gate computation (namely,         quasi-linear in the security parameter).     -   Alternatively, assuming Ring LWE is hard for quasi-polynomial         factors, we have a leveled FHE scheme that uses bootstrapping as         an optimization, where the per-gate computation (which includes         the bootstrapping procedure) is {tilde over (Ω)}(λ²),         independent of L.

We can alternatively base security on LWE, albeit with worse performance. We now sketch our main idea for boosting efficiency.

In the BV scheme [3], like ours, a ciphertext vector cεR^(n) (where R is a ring, and n is the “dimension” of the vector) that encrypts a message m satisfies the decryption formula m=[[

c,s

]_(q)]₂, where sεR^(n) is the secret key vector, q is an odd modulus, and [·]_(q), denotes reduction into the range (−q/2,q/2). This is an abstract scheme that can be instantiated with either LWE or Ring LWE—in the LWE instantiation, R is the ring of integers mod q and n is a large dimension, whereas in the Ring LWE instantiation, R is the ring of polynomials over integers mod q and an irreducible ƒ(x), and the dimension n=2.

We will call [

c,s

]_(q) the noise associated to ciphertext c under key s. Decryption succeeds as long as the magnitude of the noise stays smaller than q/2. Homomorphic addition and multiplication increase the noise in the ciphertext. Addition of two ciphertexts with noise at most B results in a ciphertext with noise at most 2B, whereas multiplication results in a noise as large as B². (The noise after multiplication is in fact a bit larger than B² due to the additional noise from the BV “re-linearization” process. For the purposes of this exposition, it is best to ignore this minor detail.) We will describe a noise-management technique that keeps the noise in check by reducing it after homomorphic operations, without bootstrapping.

The key technical tool we use for noise management is the “modulus switching” technique developed by Brakerski and Vaikuntanathan [3]. Jumping ahead, we note that while they use modulus switching in “one shot” to obtain a small ciphertext (to which they then apply Gentry's bootstrapping procedure), we will use it (iteratively, gradually) to keep the noise level essentially constant, while stingily sacrificing modulus size and gradually sacrificing the remaining homomorphic capacity of the scheme.

1.5 Modulus Switching

The essence of the modulus-switching technique is captured in the following lemma. In words, the lemma says that an evaluator, who does not know the secret key s but instead only knows a bound on its length, can transform a ciphertext c modulo q into a different ciphertext modulo p while preserving correctness—namely, [

c′,s

]_(p)=[

c,s

], mod 2. The transformation from c to c′ involves simply scaling by (p/q) and rounding appropriately! Most interestingly, if s is short and p is sufficiently smaller than q, the “noise” in the ciphertext actually decreases—namely, |[

c′,s

]_(p)|<|[

c,s

]_(q)|.

Lemma 1

Let p and q be two odd moduli, and let c be an integer vector. Define c′ to be the integer vector closest to (p/q)·c such that c′=c mod 2. Then, for any s with |[

c,s

]_(q)|<q/2−(g/p)·l₁(s), we have [

c′,s

] _(p) =[

c,s

] _(q) mod 2 and |[

c′,s

] _(p)|<(p/q)·|[

c,s

] _(q) |+l ₁(s) where l₁(s) is the l₁-norm of s.

Proof.

For some integer k, we have [

c,s

]_(q)=

c,s

−kq. For the same k, let e_(p)=

c′,s

−kpεZ. Since c′=c and p=q modulo 2, we have e_(p)=[

c′,s

]_(q) mod 2. Therefore, to prove the lemma, it suffices to prove that e_(p)=[

c′,s

], and that it has small enough norm. We have e_(p)=(p/q)[

c,s

]_(q)+

c′−(p/q)c,s

, and therefore |e_(p)|≦(p/q)[

c,s

]_(q)+l₁(s)<p/2. The latter inequality implies e_(p)=[

(c′,s

]_(p).

Amazingly, this trick permits the evaluator to reduce the magnitude of the noise without knowing the secret key, and without bootstrapping. In other words, modulus switching gives us a very powerful and lightweight way to manage the noise in FHE schemes! In [3], the modulus switching technique is bundled into a “dimension reduction” procedure, and we believe it deserves a separate name and close scrutiny. It is also worth noting that our use of modulus switching does not require an “evaluation key”, in contrast to [3].

1.6 Our New Noise Management Technique

At first, it may look like modulus switching is not a very effective noise management tool. If p is smaller than q, then of course modulus switching may reduce the magnitude of the noise, but it reduces the modulus size by essentially the same amount. In short, the ratio of the noise to the “noise ceiling” (the modulus size) does not decrease at all. Isn't this ratio what dictates the remaining homomorphic capacity of the scheme, and how can potentially worsening (certainly not improving) this ratio do anything useful?

In fact, it's not just the ratio of the noise to the “noise ceiling” that's important. The absolute magnitude of the noise is also important, especially in multiplications. Suppose that q≈x^(k), and that you have two mod-q SWHE ciphertexts with noise of magnitude x. If you multiply them, the noise becomes x². After 4 levels of multiplication, the noise is x¹⁶. If you do another multiplication at this point, you reduce the ratio of the noise ceiling (i.e. q) to the noise level by a huge factor of x¹⁶—i.e., you reduce this gap very fast. Thus, the actual magnitude of the noise impacts how fast this gap is reduced. After only log k levels of multiplication, the noise level reaches the ceiling.

Now, consider the following alternative approach. Choose a ladder of gradually decreasing moduli {q_(i)≈q/x^(i)} for i<k. After you multiply the two mod-q ciphertexts, switch the ciphertext to the smaller modulus q₁=q/x. As the lemma above shows, the noise level of the new ciphertext (now with respect to the modulus q₁) goes from x² back down to x. (Let's suppose for now that l₁(s) is small in comparison to x so that we can ignore it.) Now, when we multiply two ciphertexts (wrt modulus q₁) that have noise level x, the noise again becomes x², but then we switch to modulus q₂ to reduce the noise back to x. In short, each level of multiplication only reduces the ratio (noise ceiling)/(noise level) by a factor of x (not something like x¹⁶). With this new approach, we can perform about k (not just log k) levels of multiplication before we reach the noise ceiling. We have just increased (without bootstrapping) the number of multiplicative levels that we can evaluate by an exponential factor!

This exponential improvement is enough to achieve leveled FHE without bootstrapping. For any polynomial L, we can evaluate circuits of depth L. The performance of the scheme degrades with L—e.g., we need to set q=q₀ to have bit length proportional to L—but it degrades only polynomially with L.

Our main observation—the key to obtaining FHE without bootstrapping—is so simple that it is easy to miss and bears repeating: We get noise reduction automatically via modulus switching, and by carefully calibrating our ladder of moduli {q₁}, one modulus for each circuit level, to be decreasing gradually, we can keep the noise level very small and essentially constant from one level to the next while only gradually sacrificing the size of our modulus until the ladder is used up. With this approach, we can efficiently evaluate arbitrary polynomial-size arithmetic circuits without resorting to bootstrapping.

In terms of performance, this scheme trounces previous FHE schemes (at least asymptotically; the concrete performance remains to be seen). Instantiated with ring-LWE, it can evaluate L-level arithmetic circuits with per-gate computation Õ(λ·L³)—i.e., computation quasi-linear in the security parameter. Since the ratio of the largest modulus (namely, q≈x^(L)) to the noise (namely, x) is exponential in L, the scheme relies on the hardness of approximating short vectors to within an exponential in L factor.

1.7 Bootstrapping for Better Efficiency and Better Assumptions

In our FHE-without-bootstrapping scheme, the per-gate computation depends polynomially on the number of levels in the circuit that is being evaluated. While this approach is efficient (in the sense of “polynomial time”) for polynomial-size circuits, the per-gate computation may become undesirably high for very deep circuits. So, we re-introduce bootstrapping as an optimization that makes the per-gate computation independent of the circuit depth, and that (if one is willing to assume circular security) allows homomorphic operations to be performed indefinitely without needing to specify in advance a bound on the number of circuit levels.

We are aware of the seeming irony of trumpeting “FHE without bootstrapping” and then proposing bootstrapping “as an optimization”. But, first, FHE without bootstrapping is exciting theoretically, independent of performance. Second, whether bootstrapping actually improves performance depends crucially on the number of levels in the circuit one is evaluating. For example, for circuits of depth sub-polynomial in the security parameter, it will be more efficient asymptotically to forgo the bootstrapping optimization.

The main idea is that to compute arbitrary polynomial-depth circuits, it is enough to compute the decryption circuit of the scheme homomorphically. Since the decryption circuit has depth≈log λ, the largest modulus we need has only polylog(λ) bits, and therefore we can base security on the hardness of lattice problems with quasi-polynomial factors. Since the decryption circuit has size Õ(λ) for the RLWE-based instantiation, the per-gate computation becomes Õ(λ²) (independent of L). See Section 5 for details.

1.8 Other Optimizations

We also consider batching as an optimization. The idea behind batching is to pack multiple plaintexts into each ciphertext so that a function can be homomorphically evaluated on multiple inputs with approximately the same efficiency as homomorphically evaluating it on one input.

An especially interesting case is batching the decryption function so that multiple ciphertexts—e.g., all of the ciphertexts associated to gates at some level in the circuit—can be bootstrapped simultaneously very efficiently. For circuits of large width (say, width λ), batched bootstrapping reduces the per-gate computation in the RLWE-based instantiation to Õ(λ), independent of L. We give the details in Section 5.

1.9 Other Related Work

We note that prior to Gentry's construction, there were already a few interesting homomorphic encryptions schemes that could be called “somewhat homomorphic”, including Boneh-Goh-Nissim [2] (evaluates quadratic formulas using bilinear maps), (Aguilar Melchor)-Gaborit-Herranz [16] (evaluates constant degree polynomials using lattices) and Ishai-Paskin [13] (evaluates branching programs).

1.10 Summary

We present a novel approach to fully homomorphic encryption (FHE) that dramatically improves performance and bases security on weaker assumptions. A central conceptual contribution in our work is a new way of constructing leveled fully homomorphic encryption schemes (capable of evaluating arbitrary polynomial-size circuits), without Gentry's bootstrapping procedure.

Specifically, we offer a choice of FHE schemes based on the learning with error (LWE) or ring-LWE (RLWE) problems that have 2^(λ) security against known attacks. For RLWE, we have:

-   -   A leveled FHE scheme that can evaluate L-level arithmetic         circuits with Õ(λ·L³) per-gate computation—i.e., computation         quasi-linear in the security parameter. Security is based on         RLWE for an approximation factor exponential in L. This         construction does not use the bootstrapping procedure.     -   A leveled FHE scheme that uses bootstrapping as an optimization,         where the per-gate computation (which includes the bootstrapping         procedure) is Õ(λ²), independent of L. Security is based on the         hardness of RLWE for quasi-polynomial factors (as opposed to the         sub-exponential factors needed in previous schemes).

We obtain similar results to the above for LWE, but with worse performance.

Based on the Ring LWE assumption, we introduce a number of further optimizations to our schemes. As an example, for circuits of large width—e.g., where a constant fraction of levels have width at least λ—we can reduce the per-gate computation of the bootstrapped version to Õ(λ), independent of L, by batching the bootstrapping operation. Previous FHE schemes all required {tilde over (Ω)}(λ^(3.5)) computation per gate.

At the core of our construction is a much more effective approach for managing the noise level of lattice-based ciphertexts as homomorphic operations are performed, using some new techniques recently introduced by Brakerski and Vaikuntanathan (FOCS 2011).

2 Preliminaries

2.1 Basic Notation

In our construction, we will use a ring R. In our concrete instantiations, we prefer to use either R=Z (the integers) or the polynomial ring R=Z[x]/(x^(d)+1), where d is a power of 2.

We write elements of R in lowercase—e.g., rεR. We write vectors in bold—e.g., vεR^(n). The notation v[i] refers to the i-th coefficient of v. We write the dot product of u, vεR^(n) as

$\left\langle {u,v} \right\rangle = {{\sum\limits_{i = 1}^{n}\;{{u\lbrack i\rbrack} \cdot {v\lbrack i\rbrack}}} \in {R.}}$ When R is a polynomial ring, ∥r∥ for rεR refers to the Euclidean norm of r's coefficient vector. We say γ_(R)=max{∥a·b∥/∥a∥∥b∥:a,bεR} is the expansion factor of R. For R=Z[x]/(x^(d)+1), the value of γ_(R) is at most √{square root over (d)} by Cauchy-Schwarz. (The canonical embedding [15] provides a better, tighter way of handling the geometry of cyclotomic rings. We instead use the expansion factor, defined above, for its simplicity, and since it suffices for our asymtotic results.)

For integer q, we use R_(q) to denote R/qR. Sometimes we will use abuse notation and use R₂ to denote the set of R-elements with binary coefficients—e.g., when R=Z, R₂ may denote {0,1}, and when R is a polynomial ring, R₂ may denote those polynomials that have 0/1 coefficients. We use R_(q,d) when we also want to specify the degree of the polynomial associated to R. When it is obvious that q is not a power of two, we will use ┌log q┐ to denote 1+└log q┘. For aεR, we use the notation [a]_(q) to refer to a mod q, with coefficients reduced into the range (−q/2, q/2].

2.2 Leveled Fully Homomorphic Encryption

Most of this application will focus on the construction of a leveled fully homomorphic scheme, in the sense that the parameters of the scheme depend (polynomially) on the depth of the circuits that the scheme is capable of evaluating.

Definition 1 (Leveled FHE [7])

We say that a family of homomorphic encryption schemes {E^((L)): LεZ⁺} is leveled fully homomorphic if, for all LεZ⁺, they all use the same decryption circuit, E^((L)) compactly evaluates all circuits of depth at most L (that use some specified complete set of gates), and the computational complexity of E^((L))'s algorithms is polynomial (the same polynomial for all L) in the security parameter, L, and (in the case of the evaluation algorithm) the size of the circuit.

2.3 The Learning with Errors (LWE) Problem

The learning with errors (LWE) problem was introduced by Regev [19]. It is defined as follows.

Definition 2 (LWE)

For security parameter λ, let n=n(λ) be an integer dimension, let q=q(λ)≧2 be an integer, and let χ=χ(λ) be a distribution over Z. The LWE_(n,q,χ) problem is to distinguish the following two distributions: In the first distribution, one samples (a_(i),b_(i)) uniformly from Z_(q) ^(n+1). In the second distribution, one first draws s←Z_(q) ^(n) uniformly and then samples (a_(i),b_(i))εZ_(q) ^(n+1) by sampling a_(i)←Z_(q) ^(n) uniformly, e_(i)←χ, and setting b_(i)=

a,s

+e_(i). The LWE_(n,q,χ) assumption is that the LWE_(n,q,χ) problem is infeasible.

Regev [19] proved that for certain moduli q and Gaussian error distributions χ, the LWE_(n,q,χ) assumption is true as long as certain worst-case lattice problems are hard to solve using a quantum algorithm. We state this result using the terminology of B-bounded distributions, which is a distribution over the integers where the magnitude of a sample is bounded with high probability. A definition follows.

Definition 3 (B-Bounded Distributions)

A distribution ensemble {χ_(n)}_(nεN), supported over the integers, is called B-bounded if

${\Pr\limits_{e\leftarrow\chi_{n}}\left\lbrack {{e} > B} \right\rbrack} = {{{negl}(n)}.}$

We can now state Regev's worst-case to average-case reduction for LWE.

Theorem 1 (Regev)

For any integer dimension n, prime integer q=q(n), and B=B(n)≧2n, there is an efficiently samplable-bounded distribution χ (such that if there exists an efficient (possibly quantum) algorithm that solves LWE_(n,q,χ), then there is an efficient quantum algorithm for solving Õ(qn^(1.5)/B)-approximate worst-case SIVP and gapSVP.

Peikert [18] de-quantized Regev's results to some extent—that is, he showed the LWE_(n,q,χ) assumption is true as long as certain worst-case lattice problems are hard to solve using a classical algorithm. (See [18] for a precise statement of these results.)

Applebaum et al. [1] showed that if LWE is hard for the above distribution of s, then it is also hard when s's coefficients are sampled according to the noise distribution χ.

2.4 The Ring Learning with Errors (RLWE) Problem

The ring learning with errors (RLWE) problem was introduced by Lyubaskevsky, Peikert and Regev [15]. We will use a simplified special-case version of the problem that is easier to work with [20, 4].

Definition 4 (RLWE)

For security parameter λ, let ƒ(x)=x^(d)+1 where d=d(λ) is a power of 2. Let q=q(λ)≧2 be an integer. Let R=Z[x]/(ƒ(x)) and let R_(q)=R/qR. Let χ=χ(λ) be a distribution over R. The RLWE_(d,q,χ) problem is to distinguish the following two distributions: In the first distribution, one samples (a_(i),b_(i)) uniformly from R_(q) ². In the second distribution, one first draws s←R_(q) uniformly and then samples (a_(i),b_(i))εR_(q) ² by sampling a_(i)←R_(q) uniformly, e_(i)←χ, and setting b_(i)=a_(i)·s+e_(i). The RLWE_(d,q,χ) assumption is that the RLWE_(d,q,χ) problem is infeasible.

The RLWE problem is useful, because the well-established shortest vector problem (SVP) over ideal lattices can be reduced to it, specifically:

Theorem 2 (Lyubashevsky-Peikert-Regev [15])

For any d that is a power of 2, ring R=Z[x]/(x^(d)+1), prime integer q=q(d)=1 mod d, and B=ω(√{square root over (d log d)}), there is an efficiently samplable distribution χ that outputs elements of R of length at most B with overwhelming probability, such that if there exists an efficient algorithm that solves RLWE_(d,q,χ) then there is an efficient quantum algorithm for solving d^(ω(1))·(q/B)−approximate worst-case SVP for ideal lattices over R.

Typically, to use RLWE with a cryptosystem, one chooses the noise distribution χ according to a Gaussian distribution, where vectors sampled according to this distribution have length only poly(d) with overwhelming probability. This Gaussian distribution may need to be “ellipsoidal” for certain reductions to go through [15]. It has been shown for RLWE that one can equivalently assume that s is alternatively sampled from the noise distribution χ [15].

2.5 The General Learning with Errors (GLWE) Problem

The learning with errors (LWE) problem and the ring learning with errors (RLWE) problem are syntactically identical, aside from using different rings (Z versus a polynomial ring) and different vector dimensions over those rings (n=poly(λ) for LWE, but n is constant—namely, 1—in the RLWE case). To simplify our presentation, we define a “General Learning with Errors (GLWE)” Problem, and describe a single “GLWE-based” FHE scheme, rather than presenting essentially the same scheme twice, once for each of our two concrete instantiations.

Definition 5 (GLWE)

For security parameter λ, let n=n(λ) be an integer dimension, let ƒ(x)=x^(d)+1 where d=d(λ) is a power of 2, let q=q(λ)≧2 be a prime integer, let R=Z[x]/(ƒ(x)) and R_(q)=R/qR, and let χ=χ(λ) be a distribution over R. The GLWE_(n,f,q,χ) problem is to distinguish the following two distributions: In the first distribution, one samples (a_(i),b_(i)) uniformly from R_(q) ^(n+1). In the second distribution, one first draws s←R_(q) ^(n) uniformly and then samples (a_(i),b_(i))εR_(q) ^(n+1) by sampling a_(i)←R_(q) ^(n) uniformly, e_(i)←χ, and setting b_(i)=

a_(i),s

+_(i). The GLWE_(n,f,q,χ) assumption is that the GLWE_(n,f,q,χ) problem is infeasible.

LWE is simply GLWE instantiated with d=1. RLWE is GLWE instantiated with n=1. Interestingly, as far as we know, instances of GLWE between these extremes have not been explored. One would suspect that GLWE is hard for any (n,d) such that n·d=Ω(λlog(q/B)), where B is a bound (with overwhelming probability) on the length of elements output by χ. For fixed n·d, perhaps GLWE gradually becomes harder as n increases (if it is true that general lattice problems are harder than ideal lattice problems), whereas increasing d is probably often preferable for efficiency.

If q is much larger than B, the associated GLWE problem is believed to be easier (i.e., there is less security). Previous FHE schemes required q/B to be sub-exponential in n or d to give room for the noise to grow as homomorphic operations (especially multiplication) are performed. In our FHE scheme without bootstrapping, q/B will be exponential in the number of circuit levels to be evaluated. However, since the decryption circuit can be evaluated in logarithmic depth, the bootstrapped version of our scheme will only need q/B to be quasi-polynomial, and we thus base security on lattice problems for quasi-polynomial approximation factors.

By the GLWE assumption, the distribution {(a_(i),

a_(i),s

+t·e_(i))} is computational indistinguishable from uniform for any t relatively prime to q. This fact will be convenient for encryption, where, for example, a message m may be encrypted as (a,

a,s

+2e+m), and this fact can be used to argue that the second component of this message is indistinguishable from random.

3 (Leveled) FHE without Bootstrapping: Our Construction

The plan of this section is to present our leveled FHE-without-bootstrapping construction in modular steps. First, we describe a plain GLWE-based encryption scheme with no homomorphic operations. Next, we describe variants of the “relinearization” and “dimension reduction” techniques of [3]. Finally, in Section 3.4, we lay out our construction of FHE without bootstrapping.

3.1 Basic Encryption Scheme

We begin by presenting a basic GLWE-based encryption scheme with no homomorphic operations. Let λ be the security parameter, representing 2^(λ) security against known attacks (λ=100 is a reasonable value).

Let R=R(λ) be a ring. For example, one may use R=Z if one wants a scheme based on (standard) LWE, or one may use R=Z[x]/ƒ(x) where (e.g.) ƒ(x)=x^(d)+1 and d=d(λ) is a power of 2 if one wants a scheme based on RLWE. Let the “dimension” n=n(λ), an odd modulus q=q(λ), and a “noise” distribution χ=χ(λ) over R be additional parameters of the system. These parameters come from the GLWE assumption. For simplicity, assume for now that the plaintext space is R₂=R/2R, though larger plaintext spaces are certainly possible.

We go ahead and stipulate here—even though it only becomes important when we introduce homomorphic operations—that the noise distribution χ is set to be as small as possible. Specifically, to base security on LWE or GLWE, one must use (typically Gaussian) noise distributions with deviation at least some sub-linear function of d or n, and we will let χ be a noise distribution that barely satisfies that requirement. To achieve 2^(λ) security against known lattice attacks, one must have n·d=Ω(λ·log(q/B)) where B is a bound on the length of the noise. Since n or d depends logarithmically on q, and since the distribution χ (and hence B) depends sub-linearly on n or d, the distribution χ (and hence B) depends sub-logarithmically on q. This dependence is weak, and one should think of the noise distribution as being essentially independent of q.

Here is a basic GLWE-based encryption scheme with no homomorphic operations. It uses the plaintext space R₂, but it is easy to generalize it to plaintext spaces R_(p), p>2. It uses an integer parameter N=n·polylog(q) that we will discuss in detail following the description of the scheme.

Basic GLWE-Based Encryption Scheme:

-   -   E.Setup(1^(λ),1^(μ),b): Use the bit bε{0,1} to determine whether         we are setting parameters for a LWE-based scheme (where d=1) or         a RLWE-based scheme (where n=1). Choose a μ-bit modulus q and         choose the parameters d=d(λ,μ,b), n=n(λ,μ,b), and χ=χ(λ,μ,b)         appropriately to ensure that the scheme is based on a GLWE         instance that achieves 2^(λ) security against known attacks. Let         R=Z[x]/(x^(d)+1) and let params=(q,d,n,N,χ).     -   E.SecretKeyGen(params): Sample s′←χ^(n). Set sk=s←(1,s′[1], . .         . , s′[n])εR_(q) ^(n+1).     -   E.PublicKeyGen(params,sk): Takes as input a secret key         sk=s=(1,s′) with s[0]=1 and s′εR_(q) ^(n) and the params.         Generate matrix A′←R_(q) ^(N×x) uniformly and a vector e←χ^(N)         and set b←A′s′+2e. Set A to be the (n+1)-column matrix         consisting of b followed by the n columns of −A′. (Observe:         A·s=2e.) Set the public key pk=A.     -   E.Enc(params,pk,m): To encrypt a message mεR₂ set m←(m, 0, . . .         , 0)εR_(q) ^(n+1), sample r←R₂ ^(N) and output the ciphertext         c←m+A^(T)rεR_(q) ^(n+1).     -   E.Dec(params,sk,c): Output m←[[         c,s         ]_(q)]₂.

Correctness is easy to see, and it is straightforward to base security on special cases (depending on the parameters) of the GLWE assumption (and one can find such proofs of special cases in prior work). To sketch the main ideas, first note that if an attacker can distinguish the public key A from a uniformly random matrix over R_(q) ^(N×(n+1)), then the attacker can be used to solve the GLWE problem (for specific parameters). Therefore, assuming the GLWE problem is hard, an attacker cannot efficiently distinguish. Second, if A was indeed chosen uniformly from R_(q) ^(N×(n+1)), the encryption procedure generates ciphertexts that are statistically independent from m (by the leftover hash lemma), and therefore the attacker has negligible advantage in guessing m.

For the LWE case, it suffices to take N>2n log q [19]. For RLWE, it does not necessarily work just to take N>2n log q=2 log q due to subtle distributional issues—in particular, the problem is that R_(q) may have many zero divisors. Micciancio's regularity lemma [17] assures us that if AεR_(q) ^(N×(n+1)) and rεR₂ ^(N) are uniform, then A^(T)r has negligible statistical distance from uniform when N=log(q·λ^(ω(1))). Lyubashevsky et al. [15] (full version of the paper) give a stronger result when all of the ring elements in the matrix A are in R_(q)* (non-zero-divisors)—namely, the distribution is within 2^(−Ω(d)) of uniform when the ring elements in the r are chosen from a discrete Gaussian distribution of width dq^(1/N). (Using this result would necessitate some changes to the encryption scheme above.)

While we think our description of encryption above is useful in that it highlights the high-level similarity of LWE and RLWE, the distributional issues discussed above make it more desirable, in practice, to use a slightly different approach to encryption in the RLWE setting. In particular, Lyubashevsky et al. [15] streamline public key generation and encryption in the RLWE setting as follows:

-   -   E.PublicKeyCen(params,sk): As above, except N=1.     -   E.Enc(params, pk,m): To encrypt a message mεR₂, set         m←(m,0)εR_(q) ², sample r←χ and e←χ². Output the ciphertext         c←m+2·e+A^(T)·rεR_(q) ². (That is cεR_(q) ² is the sum of m, a         small even vector, and r (a small ring element) times the single         encryption of zero given in the public key (namely A^(T)).

The security of LPR encryption relies on RLWE: assuming RLWE, if A^(T) were uniform in R_(q) ², then the two ring elements m+a₁·r+e₁ and a₂·r+e₂ of the ciphertext generated during encryption are pseudorandom.

Below, sometimes other functions will invoke the function E.PublicKeyGen(params,sk,N) with an integer parameter N. In that case, it invokes the first version of E.PublicKeyGen (not the LPR version) with the specified value of N.

3.2 Key Switching (Dimension Reduction)

We start by reminding the reader that in the basic GLWE-based encryption scheme above, the decryption equation for a ciphertext c that encrypts m under key s can be written as m=[[L_(c)(s)]_(q)]₂ where L_(c)(x) is a ciphertext-dependent linear equation over the coefficients of x given by L_(c)(x)=

c,x

.

Suppose now that we have two ciphertexts c₁ and c₂, encrypting m₁ and m₂ respectively under the same secret key s. The way homomorphic multiplication is accomplished in [3] is to consider the quadratic equation Q_(c) ₁ _(,c) ₂ (x)←L_(c) ₁ (x)·L_(c) ₂ (x). Assuming the noises of the initial ciphertexts are small enough, we obtain m₁·m₂=[Q_(c) ₁ _(,c) ₂ (s)]_(q)]₂, as desired. If one wishes, one can view Q_(c) ₁ _(,c) ₂ (x) as a linear equation L_(c) ₁ _(,c) ₂ ^(long)(x

x) over the coefficients of x

x—that is, the tensoring of x with itself—where x

x's dimension is roughly the square of x's. Using this interpretation, the ciphertext represented by the coefficients of the linear equation L^(long) is decryptable by the long secret key s₁

s₁ via the usual dot product. Of course, we cannot continue increasing the dimension like this indefinitely and preserve efficiency.

Thus, Brakerski and Vaikuntanathan convert the long ciphertext represented by the linear equation L^(long) and decryptable by the long tensored secret key s₁

s₁ into a shorter ciphertext c₂ that is decryptable by a different secret key s₂. (The secret keys need to be different to avoid a “circular security” issue). Encryptions of s₁

s₁ under s₂ are provided in the public key as a “hint” to facilitate this conversion.

We observe that Brakerski and Vaikuntanathan's relinearization/dimension reduction procedures are actually quite a bit more general. They can be used to not only reduce the dimension of the ciphertext, but more generally, can be used to transform a ciphertext c₁ that is decryptable under one secret key vector s₁ to a different ciphertext c₂ that encrypts the same message, but is now decryptable under a second secret key vector s₂. The vectors c₂,s₂ may not necessarily be of lower degree or dimension than c₁,s₁.

Below, we review the concrete details of Brakerski and Vaikuntanathan's key switching procedures. The procedures will use some subroutines that, given two vectors c and s, “expand” these vectors to get longer (higher-dimensional) vectors c′ and s′such that

c′,s′

=

c,s

mod q. We describe these subroutines first.

-   -   BitDecomp (xεR_(q) ^(n),q) decomposes x into its bit         representation. Namely, write

$x = {\sum\limits_{j = 0}^{\lfloor{\log\; q}\rfloor}\;{2^{j} \cdot u_{j}}}$ with all u_(j)εR₂ ^(n). Output (u₀,u₁, . . . , u_(└log q┘))εR₂ ^(n·┌log q┐).

-   -   Powersof2 (xεR_(q) ^(n),q) outputs (x, 2·x, . . . ,         2^(└log q┘)·x)εR_(q) ^(n·┌log q┐).

If one knows a priori that x has coefficients in [0,B] for B=q, then BitDecomp can be optimized in the obvious way to output a shorter decomposition in R₂ ^(n·┌log B┐). Observe that:

Lemma 2

For vectors c,s of equal length, we have

BitDecomp(c,q),Powersof2(s,q)

=

c,s

mod q.

Proof.

Trivial.

We remark that this obviously generalizes to decompositions with respect to bases other than the powers of 2.

Now, key switching consists of two procedures: first, a procedure SwitchKeyGen(s₁,s₂,n₁,n₂,q), which takes as input the two secret key vectors, the respective dimensions of these vectors, and the modulus q, and outputs some auxiliary information τ_(s) ₁ _(→s) ₂ that enables the switching; and second, a procedure SwitchKey(τ_(s) ₁ _(→s) ₂ , c₁, n₁, n₂, q), that takes this auxiliary information and a ciphertext encrypted under s₁ and outputs a new ciphertext c₂ that encrypts the same message under the secret key s₂. (Below, we often suppress the additional arguments n₁,n₂,q.)

SwitchKeyGen(s₁εR_(q) ^(n) ¹ ,s₂εR_(q) ^(n) ² ):

-   -   1. Run A←E.PublicKeyGen(s₂,N) for N=n₁·┌log q┐.     -   2. Set B←A+Powersof2(s₁) (Add Powersof2(s₁)εR_(q) ^(N) to A's         first column.) Output τ_(s) ₁ _(→s) ₂ =B.

SwitchKey(τ_(s) ₁ _(→s) ₂ ,c₁): Output c₂=BitDecomp(c₁)^(T)·BεR_(q) ^(N) ² .

Note that, in SwitchKeyGen, the matrix A basically consists of encryptions of 0 under the key s₂. Then, pieces of the key s₁ are added to these encryptions of 0. Thus, in some sense, the matrix B consists of encryptions of pieces of s₁ (in a certain format) under the key s₂. We now establish that the key switching procedures are meaningful, in the sense that they preserve the correctness of decryption under the new key.

Lemma 3 [Correctness]

Let s₁,s₂,q, A,B=τ_(s) ₁ _(→s) ₂ be as in SwitchKeyGen(s₁,s₂), and let A·s₂=2e₂εR_(q) ^(N). Let c₁εR_(q) ^(n) ¹ and c₂←SwitchKey(τ_(s) ₁ _(→s) ₂ ,c₁). Then,

c ₂ ,s ₂

=2

BitDecomp(c ₁),e ₂

+

c ₁ ,s ₁

mod q

Proof.

$\begin{matrix} {\left\langle {c_{2},s_{2}} \right\rangle = {{{BitDecomp}\left( c_{1} \right)}^{T} \cdot B \cdot s_{2}}} \\ {= {{{BitDecomp}\left( c_{1} \right)}^{T} \cdot \left( {{2e_{2}} + {{Powersof}\; 2\left( s_{1} \right)}} \right)}} \\ {= {{2\left\langle {{{BitDecomp}\left( c_{1} \right)},e_{2}} \right\rangle} + \left\langle {{{BitDecomp}\left( c_{1} \right)},{{Powersof}\; 2\left( s_{1} \right)}} \right\rangle}} \\ {= {{2\left\langle {{{BitDecomp}\left( c_{1} \right)},e_{2}} \right\rangle} + \left\langle {c_{1},s_{1}} \right\rangle}} \end{matrix}$

Note that the dot product of BitDecomp(c₁) and e₂ is small, since BitDecomp(c₁) is in R₂ ^(N). Overall, we have that c₂ is a valid encryption of m under key s₂, with noise that is larger by a small additive factor.

Again, the processes above are adapted to the plaintext space R₂, but are easy to generalize.

3.3 Modulus Switching

Suppose c is a valid encryption of m under s modulo q (i.e., m=[[

c,s

]_(q)]₂) and that s is a short vector. Suppose also that c′ is basically a simple scaling of c—in particular, c′ is the R-vector closest to (p/q)·c such that c′=c mod 2. Then, it turns out (subject to some qualifications) that c′ is a valid encryption of m under s modulo p using the usual decryption equation—that is, m=[[

c′,s

]_(p)]₂! In other words, we can change the inner modulus in the decryption equation—e.g., to a smaller number—while preserving the correctness of decryption under the same secret key! The essence of this modulus switching idea, a variant of Brakerski and Vaikuntanathan's modulus reduction technique, is formally captured in Lemma 4 below.

Definition 6 (Scale)

For integer vector x and integers q>p>m, we define x′←Scale(x,q,p,r) to be the R-vector closest to (p/q)·x that satisfies x′=x mod r.

Definition 7 (l₁ ^((R)) Norm)

The (usual) norm l₁(s) over the reals equals Σ_(i)∥s[i]∥. We extend this to our ring R as follows: l₁ ^((R)) (s) for sεR^(n) is defined as Σ_(i)∥s[i]∥.

Lemma 4

Let d be the degree of the ring (e.g., d=1 when R=Z). Let q>p>r be positive integers satisfying q=p=1 mod r. Let cεR^(n) and c′←Scale(c,q,p,r). Then, for any sεR^(n) with ∥[

c,s

]_(q)∥<q/2−(q/p)·γ_(R)·(r/2)·√{square root over (d)}·l₁ ^((R))(s), we have [

c′,s

] _(p) =[

c,s

] _(q) mod r and [

c′,s

] _(p)∥<(p/q)·∥[

c,s

] _(q)∥+γ_(R)·(r/2)·√{square root over (d)}·l ₁ ^((R))(s)

Proof.

(Lemma 4) We have [

c,s

] _(q) =

c,s

−kq for some kεR. For the same k, let e _(p) =

c′,s

−kpεR Note that e_(p)=[

c′,s

]_(p) mod p. We claim that ∥e_(p)∥ is so small that e_(p)=[

c′,s

]_(p). We have:

${e_{p}} = {{{{- {kp}} + \left\langle {{\left( {p/q} \right) \cdot c},s} \right\rangle + \left\langle {{c^{\prime} - {\left( {p/q} \right) \cdot c}},s} \right\rangle}} \leq {{{{- {kp}} + \left\langle {{\left( {p/q} \right) \cdot c},s} \right\rangle}} + {\left\langle {{c^{\prime} - {\left( {p/q} \right) \cdot c}},s} \right\rangle }} \leq {{\left( {p/q} \right) \cdot {\left\lbrack \left\langle {c,s} \right\rangle \right\rbrack_{q}}} + {\gamma_{R} \cdot {\sum\limits_{j = 1}^{n}{{{{c^{\prime}\lbrack j\rbrack} - {\left( {p/q} \right) \cdot {c\lbrack j\rbrack}}}} \cdot {{s\lbrack j\rbrack}}}}}} \leq {{\left( {p/q} \right) \cdot {\left\lbrack \left\langle {c,s} \right\rangle \right\rbrack_{q}}} + {\gamma_{R} \cdot \left( {r/2} \right) \cdot \sqrt{d} \cdot {\ell_{1}^{(R)}(s)}}} < {p/2}}$ Furthermore, modulo r, we have [

c′,s

]_(p)=e_(p)=

c′,s

−kp=

c,s

−kq=[

c,s

]_(q).

The lemma implies that an evaluator, who does not know the secret key but instead only knows a bound on its length, can potentially transform a ciphertext c that encrypts m under key s for modulus q—i.e., m=[[

c,s

]_(q)]_(r)—into a ciphertext c that encrypts m under the same key s for modulus p—i.e., m=[[

c,s

]_(p)]_(r). Specifically, the following corollary follows immediately from Lemma 4.

Corollary 1

Let p and q be two odd moduli. Suppose c is an encryption of bit m under key s for modulus q—i.e., m=[[

c′,s

]_(q)]_(r). Moreover, suppose that s is a fairly short key and the “noise” e_(q)←[

c,s

]_(q) has small magnitude—precisely, assume that ∥e_(q)∥<q/2−(q/p)·(r/2)·√{square root over (d)}·γ_(R)·l₁ ^((R))(s). Then c′←Scale(c,q,p,r) is an encryption of bit m under key s for modulus p—i.e., m=[[

c,s

]_(p)]_(r). The noise e_(p)=[

c′,s

]_(p) of the new ciphertext has magnitude at most (p/q)·∥[

c,s

]_(q)∥+γ_(R)·(r/2)·√{square root over (d)}·l₁ ^((R))(s).

Amazingly, assuming p is smaller than q and s has coefficients that are small in relation to q, this trick permits the evaluator to reduce the magnitude of the noise without knowing the secret key! (Of course, this is also what Gentry's bootstrapping transformation accomplishes, but in a much more complicated way.)

3.4 (Leveled) FHE Based on GLWE without Bootstrapping

We now present our FHE scheme. Given the machinery that we have described in the previous subsections, the scheme itself is remarkably simple.

In our scheme, we will use a parameter L indicating the number of levels of arithmetic circuit that we want our FHE scheme to be capable of evaluating. Note that this is an exponential improvement over prior schemes, that would typically use a parameter d indicating the degree of the polynomials to be evaluated.

(Note: the linear polynomial L^(long), used below, is defined in Section 3.2.)

Our FHE Scheme without Bootstrapping:

-   -   FHE.Setup(1^(λ),1^(L),b): Takes as input the security parameter,         a number of levels L, and a bit b. Use the bit bε{0,1} to         determine whether we are setting parameters for a LWE-based         scheme (where d=1) or a RLWE-based scheme (where n=1). Let         μ=μ(λ,L,b)=θ(log λ+log L) be a parameter that we will specify in         detail later. For j=L (input level of circuit) to 0 (output         level), run params_(j)←E.Setup(1^(λ),1^(j+1)·μ),b) to obtain a         ladder of parameters, including a ladder of decreasing moduli         from q_(L)((L+1)·μ bits) down to q₀ (μ bits). (The ring degree         d_(j), dimension n_(j), and noise distribution χ_(j) do not         necessarily need to vary (decrease) with the circuit level. In         the procedure below, we allow n_(j) and χ_(j) to vary, but defer         the case of decreasing d_(j).)     -   FHE.KeyGen({params_(j)}): For j=L down to 0, do the following:     -   (1) Run s_(j)←E.SecretKeyGen(params_(j)) and         A_(j)←E.PublicKeyGen(params_(j),s_(j)).     -   (2) Set s_(j′)←s_(j)         s_(j)εR_(q) _(j) ^(n) ^(j) ⁺¹ ² . That is, s_(f′) is a tensoring         of s_(j) with itself whose coefficients are each the product of         two coefficients of s_(j) in R_(q) _(j) .     -   (3) Run τ_(s) _(j+1′) _(→s) _(j) ←SwitchKeyGen(s_(j+1)′,s_(j)).         (Omit this step when j=L.)

The secret key sk consists of the s_(j)'s and the public key pk consists of the A_(j)'s and τ_(s) _(j+1′) _(→s) _(j) 's.

-   -   FHE.Enc(params, pk,m): Take a message in R₂. Run         E.Enc(params_(L),A_(L),m).     -   FHE.Dec(params,sk,c): Suppose the ciphertext is under key s_(j).         Run E.Dec(params_(j),s_(j),c). (The ciphertext could be         augmented with an index indicating which level it belongs to.)     -   FHE.Add(pk,c₁,c₂): Takes two ciphertexts encrypted under the         same s_(j). (If needed, use FHE.Refresh (below) to make it so.)         Set c₃←c₁+c₂ mod q_(j). Interpret c₃ as a ciphertext under         s_(j′) (s_(j′)'s coefficients include all of s_(j)'s since         s_(j′)=s_(j)         s_(j) and s_(j)'s first coefficient is 1) and output:         c ₄←FHE.Refresh(c ₃,τ_(s) _(j′) _(→s) _(j−1) ,q _(j) ,q _(j−1))     -   FHE.Mult(pk,c₁,c₂): Takes two ciphertexts encrypted under the         same s_(j). If needed, use FHE.Refresh (below) to make it so.)         First, multiply: the new ciphertext, under the secret key         s_(j′)=s_(j)         s_(j), is the coefficient vector c₃ of the linear equation L_(c)         ₁ _(,c) ₂ ^(Long)(x         x). Then, output:         c ₄←FHE.Refresh(c ₃,τ_(s) _(j′) _(→s) _(j−1) ,q _(j) ,q _(j−1)     -   FHE.Refresh(c, τ_(s) _(j′) _(→s) _(j−1) ,q_(j),q_(j−1)): Takes a         ciphertext encrypted under s_(j′), the auxiliary information to         facilitate key switching, and the current and next moduli q_(j)         and q_(j−1). Do the following:     -   (1) Switch Keys: Set c₁←SwitchKey(τ_(s) _(j′) _(→s) _(j−1)         ,c,q_(j)), a ciphertext under the key s_(j−1) for modulus q_(j).     -   (2) Switch Moduli: Set c₂ Scale(c₁,q_(j),q_(j−1),2), a         ciphertext under the key s_(j−1) for modulus q_(j−1).

Remark 1

We mention the obvious fact that, since addition increases the noise much more slowly than multiplication, one does not necessarily need to refresh after additions, even high fan-in ones.

The key step of our new FHE scheme is the Refresh procedure. If the modulus q_(j−1) is chosen to be smaller than q_(j) by a sufficient multiplicative factor, then Corollary 1 implies that the noise of the ciphertext output by Refresh is smaller than that of the input ciphertext—that is, the ciphertext will indeed be a “refreshed” encryption of the same value. We elaborate on this analysis in the next section.

One can reasonably argue that this scheme is not “FHE without bootstrapping” since τ_(s) _(j′) _(→S) _(j−1) can be viewed as an encrypted secret key, and the SwitchKey step can viewed as a homomorphic evaluation of the decryption function. We prefer not to view the SwitchKey step this way. While there is some high-level resemblance, the low-level details are very different, a difference that becomes tangible in the much better asymptotic performance. To the extent that it performs decryption, SwitchKey does so very efficiently using an efficient (not bit-wise) representation of the secret key that allows this step to be computed in quasi-linear time for the RLWE instantiation, below the quadratic lower bound for bootstrapping. Certainly SwitchKey does not use the usual ponderous approach of representing the decryption function as a boolean circuit to be traversed homomorphically. Another difference is that the SwitchKey step does not actually reduce the noise level (as bootstrapping does); rather, the noise is reduced by the Scale step.

4 Correctness, Setting the Parameters, Performance, and Security

Here, we will show how to set the parameters of the scheme so that the scheme is correct. Mostly, this involves analyzing each of the steps within FHE.Add and FHE.Mult—namely, the addition or multiplication itself, and then the SwitchKey and Scale steps that make up FHE.Refresh—to establish that the output of each step is a decryptable ciphertext with bounded noise. This analysis will lead to concrete suggestions for how to set the ladder of moduli and to asymptotic bounds on the performance of the scheme.

Let us begin by considering how much noise FHE.Enc introduces initially. Throughout, B_(χ) denotes a bound such that R-elements sampled from the noise distribution χ have length at most B_(χ) with overwhelming probability.

4.1 The Initial Noise from FHE.Enc

Recall that FHE.Enc simply invokes E.Enc for suitable parameters (params_(L)) that depend on λ and L. In turn, the noise of ciphertexts output by E.Enc depends on the noise of the initial “ciphertext(s)” (the encryption(s) of 0) implicit in the matrix A output by E.PublicKeyCen, whose noise distribution is dictated by the distribution χ.

Lemma 5

Let q, d, n, N be the parameters associated to FHE.Enc. Let γ_(R) be the expansion factor associated to R. (γ_(R) and d are both 1 in the LWE case R=Z.) The length of the noise in ciphertexts output by FHE.Enc is at most √{square root over (d)}+2·γ_(R)·√{square root over (d)}·N·B_(χ).

Proof.

We have A·s=2e where s←E.SecretKeyGen, A←E.PublicKeyGen(s,N), and e←χ^(N). Recall that encryption works as follows: c←m+A^(T)r mod q where rεR₂ ^(N). We have that the noise of this ciphertext is [

c,s

]_(q)=[m+2

r,e

]_(q). The magnitude of this element is at most

${\sqrt{d} + {2 \cdot \gamma_{R} \cdot {\sum\limits_{j = 1}^{N}\;{{{r\lbrack j\rbrack}} \cdot {{e\lbrack j\rbrack}}}}}} \leq {\sqrt{d} + {2 \cdot \gamma_{R} \cdot \sqrt{d} \cdot N \cdot {B_{ϰ}.}}}$

One can easily obtain a similar small bound on the noise of ciphertexts output by LPR encryption in the RLWE setting: a small polynomial in the security parameter λ, L, and log q.

The correctness of decryption for ciphertexts output by FHE.Enc, assuming the noise bound above is less than q/2, follows directly from the correctness of the basic encryption and decryption algorithms E.Enc and E.Dec.

4.2 Correctness and performance of FHE.Add and FHE.Mult (BEFORE FHE.Refresh)

Consider FHE.Mult. One begins FHE.Mult(pk,c₁,c₂) with two ciphertexts under key s_(j) for modulus q_(j) that have noises e_(i)=[L_(c) _(i) (s_(j))]_(q) _(j) , where L_(c) _(i) (x) is simply the dot product

c_(i),x

. To multiply together two ciphertexts, one multiplies together these two linear equations to obtain a quadratic equation Q_(c) ₁ _(,c) ₂ (x)←L_(c) ₁ (x)·L_(c) ₂ (x), and then interprets this quadratic equation as a linear equation L_(c) ₁ _(,c) ₂ ^(long)(x

x)=Q_(c) ₁ _(,c) ₂ (x) over the tensored vector x

x. The coefficients of this long linear equation compose the new ciphertext vector c₃. Clearly, [

c₃,s_(j)

s_(j)

]_(q) _(j) =[L_(c) ₁ _(,c) ₂ ^(long)(s_(j)

s_(j))]_(q) _(j) =[e₁·e₂]_(q) _(j) . Thus, if the noises of c₁ and c₂ have length at most B, then the noise of c₃ has length at most γ_(R)·B², where γ_(R) is the expansion factor of R. If this length is less than q_(j)/2, then decryption works correctly. In particular, if m_(i)=[

c_(i),s_(j)

]_(q) _(j) ]₂=[e_(i)]₂ for iε{1,2}, then over R₂ we have [

c₃,s_(j)

s_(j)

]_(q) _(j) ]₂=[[e₁·e₂]_(q) _(j) ]₂=[e₁·e₂]₂=[e₁]₂·[e₂]₂=m₁·m₂. That is, correctness is preserved as long as this noise does not wrap modulo q_(j).

The correctness of FHE.Add and FHE.Mult, before the FHE.Refresh step is performed, is formally captured in the following lemmas.

Lemma 6

Let c₁ and c₂ be two ciphertexts under key s_(j) for modulus q_(j), where ∥[

c_(i),s_(j)

]_(q) _(j) ∥≦B and m_(i)=[[

c_(i),s_(j)

]_(q) _(j) ]₂. Let s_(j′)=s_(j)

s_(j), where the “non-quadratic coefficients” of s_(f′) (namely, the ‘1’ and the coefficients of s_(j)) are placed first. Let c′=c₁+c₂, and pad c′ with zeros to get a vector c₃ such that

c₃,s_(j′)

=

c′,s_(j)

. The noise [

c₃,s_(j′)

] has length at most 2B. If 2B<q_(j)/2, c₃ is an encryption of m₁+m₂ under key s_(j′) for modulus q_(j)—i.e., m₁·m₂=[[

c₃,s_(j′)

]_(q) _(j) ]₂.

Lemma 7

Let c₁ and c₂ be two ciphertexts under key s_(j) for modulus q_(j), where ∥[

c_(i),s_(j)

]_(q) _(j) ∥≦B and m_(i)=[[

c_(i),s_(j)

]_(q) _(j) ]₂. Let the linear equation L_(c) ₁ _(,c) ₂ ^(long)(x

x) be as defined above, let c₃ be the coefficient vector of this linear equation, and let s_(j′)=s_(j)

s_(j). The noise [

c₃,s_(j′)

)]_(q) _(j) has length at most γ_(R)·B². If γ_(R)·B²<q_(j)/2, c₃ is an encryption of m₁·m₂ under key s_(j′) for modulus q_(j)—i.e., m₁·m₂=[[

c₃,s_(j′)

]_(q) _(j) ]₂.

The computation needed to compute the tensored ciphertext c₃ is Õ(d_(j)n_(j) ² log q_(j)). For the RLWE instantiation, since n_(j)=1 and since (as we will see) d_(j) (resp. log q_(j)) depend only quasi-linearly (resp. logarithmically) on the security parameter and linearly (resp. linearly) on L, the computation here is only quasi-linear in the security parameter. For the LWE instantiation, the computation is quasi-quadratic.

4.3 Correctness and performance of FHE.Refresh

FHE.Refresh consists of two steps: Switch Keys and Switch Moduli. We address each of these steps in turn.

Correctness and Performance of the Switch-Key Step.

In the Switch Keys step, we take as input a ciphertext c under key s_(j′) for modulus q_(j) and set c₁←SwitchKey(τ_(s) _(j′) _(→s) _(j−1) ,c,q_(j)), a ciphertext under the key s_(j−1) for modulus q_(j). In Lemma 3, we proved the correctness of key switching and showed that the noise grows by the additive factor 2

BitDecomp(c,q_(j)),e

, where BitDecomp(c,q_(j)) is a (short) bit-vector and e is a (short and fresh) noise vector with elements sampled from χ. In particular, if the noise originally had length B, then after the Switch Keys step it has length at most

${{B + {2 \cdot \gamma_{R} \cdot B_{ϰ} \cdot {\sum\limits_{i = 1}^{w_{j}}\;{{{{BitDecomp}\left( {c,q_{j}} \right)}\lbrack i\rbrack}}}}} \leq {B + {2 \cdot \gamma_{R} \cdot B_{ϰ} \cdot w_{j} \cdot \sqrt{d_{j}}}}},$ where

$w_{j} \leq {\begin{pmatrix} {n_{j} + 1} \\ 2 \end{pmatrix} \cdot \left\lceil {\log\; q_{j}} \right\rceil}$ is the dimension of BitDecomp(c,q_(j)).

We capture the correctness of the Switch-Key step in the following lemma.

Lemma 8

Let c be a ciphertext under the key s_(f′)=s_(j)

s_(j) for modulus q_(j) such that e₁←[

c,s_(j′)

]_(q) _(j) has length at most B and m=[e₁]₂. Let c₁←SwitchKey(τ_(s) _(j′) _(→s) _(j−1) ,c,q_(j)), and let e₂=[

c₁,s_(j−1)

]q_(j). Then, e₂ (the new noise) has length at most B+2·γ_(R)·B_(χ)·n_(j)+1₂·┌log q_(j)┐·√{square root over (d_(j))} and (assuming this noise length is less than q_(j)/2) we have m=[e₂]₂.

The Switch-Key step involves multiplying the transpose of w_(j)-dimensional vector BitDecomp(c,q_(j)) with a w_(j)×(n_(j)+1) matrix B. This computation is Õ(d_(j)n_(j) ³ log² q_(j)). Still this is quasi-linear in the RLWE instantiation.

Correctness and Performance of the Switch-Moduli Step.

The Switch Moduli step takes as input a ciphertext c₁ under the secret bit-vector s_(j−1) for the modulus q_(j), and outputs the ciphertext c₂←Scale(c₁,q_(j),q_(j−1),2), which we claim to be a ciphertext under key s_(j−1) for modulus q_(j−1). Note that s_(j−1) is a short secret key. By Corollary 1, and using the fact that l₁(s_(j−1))≦(n_(j−1)+1). B_(χ), the following is true: if the noise of c₁ has length at most B<q_(j)/2−(q_(j)/q_(j−1))·√{square root over (d_(j))}·γ_(R)·(n_(j−1)+1)·B_(χ), then correctness is preserved and the noise of c₂ is bounded by (q_(j−i)/q_(j))·B+√{square root over (d_(j))}·γ_(R)·(n_(j−1)+1)·B_(χ). Of course, the key feature of this step for our purposes is that switching moduli may reduce the length of the moduli when q_(j−1)<q_(j).

We capture the correctness of the Switch-Moduli step in the following lemma.

Lemma 9

Let c₁ be a ciphertext under the key s_(j−1), sampled from χ^(n) ^(j−1) , such that e_(j)←[

c₁,s_(j−1)

]_(q) _(j) has length at most B and m=[e_(j)]₂. Let c₂←Scale(c₁,q_(j),q_(j−1),2) and let e_(j−1)=[

c₂,s_(j−1)

]_(q) _(j−1) . Then, e_(j−1) (the new noise) has length at most (q_(j−1)/q_(j))·B+√{square root over (d_(j))}·γ_(R)·(n_(j−1)+1)·B_(χ), and (assuming this noise length is less than q_(j−1)/2) we have m=[e_(j−1)]₂.

The computation in the Switch-Moduli step is Õ(d_(j)n_(j−1) log q_(j)).

4.4 Putting the Pieces Together: Parameters, Correctness, Performance

So far we have established that the scheme is correct, assuming that the noise does not wrap modulo q_(j) or q_(j−1). Now we need to show that we can set the parameters of the scheme to ensure that such wrapping never occurs.

Our strategy for setting the parameters is to pick a “universal” bound B on the noise length, and then prove, for all j, that a valid ciphertext under key s_(j) for modulus q_(j) has noise length at most B. This bound B is quite small: polynomial in λ and log q_(L), where q_(L) is the largest modulus in our ladder. It is clear that such a bound B holds for fresh ciphertexts output by FHE.Enc. (Recall the discussion from Section 3.1 where we explained that we use a noise distribution χ that is essentially independent of the modulus.) The remainder of the proof is by induction—i.e., we will show that if the bound holds for two ciphertexts c₁,c₂ at level j, our lemmas above imply that the bound also holds for the ciphertext c′←FHE.Mult(pk,c₁,c₂) at level j−1. (FHE.Mult increases the noise strictly more in the worst-case than FHE.Add for any reasonable choice of parameters.)

Specifically, after the first step of FHE.Mult (without the Refresh step), the noise has length at most γ_(R)·B². Then, we apply the SwitchKey function, which introduces an additive term η_(SwitchKey,j). Finally, we apply the Scale function. The noise is now at most (q _(j−1) /q _(j))·(γ_(R) ·B ²+η_(SwitchKey,j))+η_(Scale,j) where η_(Scale,j) is another additive term. Now we want to choose our parameters so that this bound is at most B.

Suppose we set our ladder of moduli and the bound B such that the following two properties hold:

-   -   Property 1: B≧2·(η_(Scale,j)+η_(SwitchKey,j)) for all j.     -   Property 2: q_(j)/q_(j−1)≧2·B·γ_(R) for all j.

Then we have

${{\left( {q_{j - 1}/q_{j}} \right) \cdot \left( {{\gamma_{R} \cdot B^{2}} + \eta_{{SwitchKey},j}} \right)} + \eta_{{Scale},j}} < {{\left( {q_{j - 1}/q_{j}} \right) \cdot \gamma_{R} \cdot B^{2}} + \eta_{{Scale},j} + \eta_{{SwitchKey},j}} \leq {{\frac{1}{2 \cdot B \cdot \gamma_{R}} \cdot \gamma_{R} \cdot B^{2}} + {\frac{1}{2} \cdot B}} \leq B$ It only remains to set our ladder of moduli and B so that Properties 1 and 2 hold.

Unfortunately, there is some circularity in Properties 1 and 2: q_(L) depends on B, which depends on q_(L), albeit only polylogarithmically. However, it is easy to see that this circularity is not fatal. As a non-optimized example to illustrate this, set B=λ^(a)·L^(b) for very large constants a and b, and set q_(j)≈2^((j+1)·ω·(log λ+log L)). If a and b are large enough, B dominates n_(Scale,L)+η_(SwitchKey,L), which is polynomial in λ and log q_(L), and hence polynomial in λ and L (Property 1 is satisfied). Since q_(j)/q_(j−1) is super-polynomial in both λ and L, it dominates 2·B·γ_(R) (Property 2 is satisfied). In fact, it works fine to set q_(j) as a modulus having (j+1)·μ bits for some μ=θ(log λ+log L) with small hidden constant.

Overall, we have that q_(L), the largest modulus used in the system, is θ(L·(log λ+log L)) bits, and d_(L)·n_(L) must be approximately that number times λ for 2^(λ) security.

Theorem 3

For some μ=″(log λ+log L), FHE is a correct L-leveled FHE scheme—specifically, it correctly evaluates circuits of depth L with Add and Mult gates over R₂. The per-gate computation is Õ(d_(L)·n_(L) ³·log²q_(j))=Õ(d_(L)·n_(L) ³·L²). For the LWE case (where d=1), the per-gate computation is Õ(λ³·L⁵). For the RLWE case (where n=1), the per-gate computation is Õ(λ·L³).

The bottom line is that we have a RLWE-based leveled FHE scheme with per-gate computation that is only quasi-linear in the security parameter, albeit with somewhat high dependence on the number of levels in the circuit.

Let us pause at this point to reconsider the performance of previous FHE schemes in comparison to our new scheme. Specifically, as we discussed in the Introduction, in previous SWHE schemes, the ciphertext size is at least Õ(λ·d²), where d is the degree of the circuit being evaluated. One may view our new scheme as a very powerful SWHE scheme in which this dependence on degree has been replaced with a similar dependence on depth. (Recall the degree of a circuit may be exponential in its depth.) Since polynomial-size circuits have polynomial depth, which is certainly not true of degree, our scheme can efficiently evaluate arbitrary circuits without resorting to bootstrapping.

4.5 Security

The security of FHE follows by a standard hybrid argument from the security of E, the basic scheme described in Section 3.1. We omit the details.

5 Optimizations

Despite the fact that our new FHE scheme has per-gate computation only quasi-linear in the security parameter, we present several significant ways of optimizing it. We focus primarily on the RLWE-based scheme, since it is much more efficient.

Our first optimization is batching. Batching allows us to reduce the per-gate computation from quasi-linear in the security parameter to polylogarithmic. In more detail, we show that evaluating a function ƒ homomorphically in parallel on l=Ω(λ) blocks of encrypted data requires only polylogarithmically (in terms of the security parameter λ) more computation than evaluating ƒ on the unencrypted data. (The overhead is still polynomial in the depth L of the circuit computing ƒ.) Batching works essentially by packing multiple plaintexts into each ciphertext.

Next, we reintroduce bootstrapping as an optimization rather than a necessity (Section 5.2). Bootstrapping allows us to achieve per-gate computation quasi-quadratic in the security parameter, independent of the number levels in the circuit being evaluated.

In Section 5.3, we show that batching the bootstrapping function is a powerful combination. With this optimization, circuits whose levels mostly have width at least λ can be evaluated homomorphically with only Õ(λ) per-gate computation, independent of the number of levels.

Finally, Section 5.5 presents a few other miscellaneous optimizations.

5.1 Batching

Suppose we want to evaluate the same function ƒ on l blocks of encrypted data. (Or, similarly, suppose we want to evaluate the same encrypted function ƒ on l blocks of plaintext data.) Can we do this using less than l times the computation needed to evaluate ƒ on one block of data? Can we batch?

For example, consider a keyword search function that returns ‘1’ if the keyword is present in the data and ‘0’ if it is not. The keyword search function is mostly composed of a large number of equality tests that compare the target word w to all of the different subsequences of data; this is followed up by an OR of the equality test results. All of these equality tests involve running the same w-dependent function on different blocks of data. If we could batch these equality tests, it could significantly reduce the computation needed to perform keyword search homomorphically.

If we use bootstrapping as an optimization (see Section 5.2), then obviously we will be running the decryption function homomorphically on multiple blocks of data—namely, the multiple ciphertexts that need to be refreshed. Can we batch the bootstrapping function? If we could, then we might be able to drastically reduce the average per-gate cost of bootstrapping.

Smart and Vercauteren [23] were the first to rigorously analyze batching in the context of FHE. In particular, they observed that ideal-lattice-based (and RLWE-based) ciphertexts can have many plaintext slots, associated to the factorization of the plaintext space into algebraic ideals.

When we apply batching to our new RLWE-based FHE scheme, the results are pretty amazing. Evaluating ƒ homomorphically on l=Ω(λ) blocks of encrypted data requires only polylogarithmically (in terms of the security parameter λ) more computation than evaluating ƒ on the unencrypted data. (The overhead is still polynomial in the depth L of the circuit computing ƒ.) As we will see later, for circuits whose levels mostly have width at least λ, batching the bootstrapping function (i.e., batching homomorphic evaluation of the decryption function) allows us to reduce the per-gate computation of our bootstrapped scheme from Õ(λ²) to Õ(λ) (independent of L).

To make the exposition a bit simpler, in our RLWE-based instantiation where R=Z[x]/(x^(d)+1), we will not use R₂ as our plaintext space, but instead use a plaintext space R_(p), prime p=1 mod 2d, where we have the isomorphism R_(p)≅R_(p) ₁ × . . . ×R_(p) _(d) of many plaintext spaces (think Chinese remaindering), so that evaluating a function once over R_(p) implicitly evaluates the function many times in parallel over the respective smaller plaintext spaces. The p_(i)'s will be ideals in our ring R=Z[x]/(x^(d)+1). (One could still use R₂ as in [23], but the number theory there is a bit more involved.)

5.1.1 Some Number Theory

Let us take a very brief tour of algebraic number theory. Suppose p is a prime number satisfying p=1 mod 2d, and let a be a primitive 2d-th root of unity modulo p. Then, X^(d)+1 factors completely into linear polynomials modulo p—in particular,

${x^{d} + 1} = {\prod\limits_{i = 1}^{d}\;{\left( {x - a_{i}} \right){mod}\; p}}$ where a_(i)=a^(2i-1) mod p. In some sense, the converse of the above statement is also true, and this is the essence of reciprocity—namely, in the ring R=Z[x]/(x^(d)+1) the prime integer p is not actually prime, but rather it splits completely into prime ideals in R—i.e.,

$p = {\prod\limits_{i = 1}^{d}\;{\rho_{i}.}}$ The ideal p_(i) equals (p,x−a_(i))—namely, the set of all R-elements that can be expressed as r₁·p+r₂·(x−a_(i)) for some r₁,r₂εR. Each ideal p_(i) has norm p—that is, roughly speaking, a 1/p fraction of R-elements are in p_(i), or, more formally, the p cosets 0+p_(i), . . . , (p−1)+p_(i) partition R. These ideals are relative prime, and so they behave like relative prime integers. In particular, the Chinese Remainder Theorem applies: R_(p)≅R_(p) ₁ × . . . ×R_(p) _(d) .

Although the prime ideals {p_(i)} are relatively prime, they are close siblings, and it is easy, in some sense, to switch from one to another. One fact that we will use (when we finally apply batching to bootstrapping) is that, for any i,j there is an automorphism σ_(i→j) over R that maps elements of p_(i) to elements of p_(j). Specifically, σ_(i→j) works by mapping an R-element r=r(x)=r_(d-1)x^(d-1)+ . . . +r₁x+r₀ to r(x^(e) ^(ij) )=r_(d-1)x^(e) ^(ij) ^((d-1)mod 2d)+ . . . +r₁x^(e) ^(ij) +r₀ where e_(ij) is some odd number in [1,2d]. Notice that this automorphism just permutes the coefficients of r and fixes the free coefficient. Notationally, we will use σ_(i→j)(v) to refer to the vector that results from applying σ_(i→j) coefficient-wise to v.

5.1.2 How Batching Works

Deploying batching inside our scheme FHE is quite straightforward. First, we pick a prime p=1 mod 2d of size polynomial in the security parameter. (One should exist under the GRH.)

The next step is simply to recognize that our scheme FHE works just fine when we replace the original plaintext space R₂ with R_(p). There is nothing especially magical about the number 2. In the basic scheme E described in Section 3.1, E.PublicKeyCen(params,sk) is modified in the obvious way so that A·s=p·e rather than 2·e. (This modification induces a similar modification in SwitchKeyGen.) Decryption becomes m=[[

c,s

]_(q)]_(p), Homomorphic operations use mod-p gates rather than boolean gates, and it is easy (if desired) to emulate boolean gates with mod-p gates—e.g., we can compute XOR (a,b) for a,b ε{0,1}² using mod-p gates for any p as a+b−2ab. For modulus switching, we use Scale(c₁,q_(j),q_(j−1),p) rather than Scale(c₁,q_(j),q_(j−1),2). The larger rounding error from this new scaling procedure increases the noise slightly, but this additive noise is still polynomial in the security parameter and the number of levels, and thus is still consistent with our setting of parameters. In short, FHE can easily be adapted to work with a plaintext space R_(p) for p of polynomial size.

The final step is simply to recognize that, by the Chinese Remainder Theorem, evaluating an arithmetic circuit over R_(p) on input xεR_(p) ^(n) implicitly evaluates, for each i, the same arithmetic circuit over R_(p) _(i) on input x projected down to R_(p) _(i) ^(n). The evaluations modulo the various prime ideals do not “mix” or interact with each other.

Theorem 4

Let p=1 mod 2d be a prime of size polynomial in λ. The RLWE-based instantiation of FHE using the ring R=Z[x]/(x^(d)+1) can be adapted to use the plaintext space R_(p)=

_(i=1) ^(d)R_(p) _(i) while preserving correctness and the same asymptotic performance. For any boolean circuit ƒ of depth L, the scheme can homomorphically evaluate ƒ on sets of inputs with per-gate computation Õ(λ·L³/min{d,l}).

When l≧λ, the per-gate computation is only polylogarithmic in the security parameter (still cubic in L).

5.2 Bootstrapping as an Optimization

Bootstrapping is no longer strictly necessary to achieve leveled FHE. However, in some settings, it may have some advantages:

-   -   Performance: The per-gate computation is independent of the         depth of the circuit being evaluated.     -   Flexibility: Assuming circular security, a bootstrapped scheme         can perform homomorphic evaluations indefinitely without needing         to specify in advance, during Setup, a bound on the number of         circuit levels.     -   Memory: Bootstrapping permits short ciphertexts—e.g., encrypted         using AES other space-efficient cryptosystem—to be de-compressed         to longer ciphertexts that permit homomorphic operations.         Bootstrapping thus allows us to save memory by storing data         encrypted in the compressed form, while retaining the ability to         perform homomorphic operations.

Here, we revisit bootstrapping, viewing it as an optimization rather than a necessity. We also reconsider the scheme FHE that we described in Section 3, viewing the scheme not as an end in itself, but rather as a very powerful SWHE whose performance degrades polynomially in the depth of the circuit being evaluated, as opposed to previous SWHE schemes whose performance degrades polynomially in the degree. In particular, we analyze how efficiently it can evaluate its decryption function, as needed to bootstrap. Not surprisingly, our faster SWHE scheme can also bootstrap faster. The decryption function has only logarithmic depth and can be evaluated homomorphically in time quasi-quadratic in the security parameter (for the RLWE instantiation), giving a bootstrapped scheme with quasi-quadratic per-gate computation overall.

5.2.1 Decryption as a Circuit of Quasi-Linear Size and Logarithmic Depth

Recall that the decryption function is m=[[

c,s

]_(q)]₂. Suppose that we are given the “bits” (elements in R₂) of s as input, and we want to compute [[

c,s

]_(q)]₂ using an arithmetic circuit that has Add and Mult gates over R₂. (When we bootstrap, of course we are given the bits of s in encrypted form.) Note that we will run the decryption function homomorphically on level-0 ciphertexts—i.e., when q is small, only polynomial in the security parameter. What is the complexity of this circuit? Most importantly for our purposes, what is its depth and size? The answer is that we can perform decryption with Õ(λ) computation and O(log λ) depth. Thus, in the RLWE instantiation, we can evaluate the decryption function homomorphically using our new scheme with quasi-quadratic computation. (For the LWE instantiation, the bootstrapping computation is quasi-quartic.)

First, let us consider the LWE case, where c and s are n-dimensional integer vectors. Obviously, each product c[i]·s[i] can be written as the sum of at most log q “shifts” of s[i]. These horizontal shifts of s[i] use at most 2 log q columns. Thus,

c,s

can be written as the sum of n·log q numbers, where each number has 2 log q digits. As discussed in [8], we can use the three-for-two trick, which takes as input three numbers in binary (of arbitrary length) and outputs (using constant depth) two binary numbers with the same sum. Thus, with O(log(n·log q))=O(log n+log log q) depth and O(n log² q) computation, we obtain two numbers with the desired sum, each having O(log n+log q) bits. We can sum the final two numbers with O(log log n+log log q) depth and O(log n+log q) computation. So far, we have used depth O(log n+log log q) and O(n log² q) computation to compute

c,s

. Reducing this value modulo q is an operation akin to division, for which there are circuits of size poly log(q) and depth log log q. Finally, reducing modulo 2 just involves dropping the most significant bits. Overall, since we are interested only in the case where log q=O(log λ), we have that decryption requires Õ(λ) computation and depth O(log λ).

When we evaluate decryption homomorphically in our RLWE-based scheme, we can use the R₂ plaintext space to emulate the simpler plaintext space Z₂. Using Z₂ the analysis is basically the same as above, except that we mention that the DFT is used to multiply the R-elements that compose the ciphertexts. We note that we can use the techniques of Section 4 to make the final ring dimension d₀ completely independent of the depth needed to evaluate the decryption circuit. However, we could bootstrap even without this optimization, as the depth of decryption only grows logarithmically with d₀=d_(L), whereas the number of levels that can be evaluated grows linearly with d_(L).

In practice, one would want to tighten up this analysis by reducing the polylogarithmic factors in the computation and the constants in the depth. Most likely this could be done by evaluating decryption using symmetric polynomials [8, 9] or with a variant of the “grade-school addition” approach used in the Gentry-Halevi implementation [10].

5.2.2 Bootstrapping Lazily

Bootstrapping is rather expensive computationally. In particular, the cost of bootstrapping a ciphertext is greater than the cost of a homomorphic operation by approximately a factor of λ. This suggests the question: can we lower per-gate computation of a bootstrapped scheme by bootstrapping lazily—i.e., applying the refresh procedure only at a 1/L fraction of the circuit levels for some well-chosen L [12]? Here we show that the answer is yes. By bootstrapping lazily for L=θ(log λ), we can lower the per-gate computation by a logarithmic factor.

Let us present this result somewhat abstractly. Suppose that the per-gate computation for a L-level no-bootstrapping FHE scheme is ƒ(λ,L)=λ^(a) ¹ ·L^(a) ² . (We ignore logarithmic factors in ƒ, since they will not affect the analysis, but one can imagine that they add a very small ε to the exponent.) Suppose that bootstrapping a ciphertext requires a c-depth circuit. Since we want to be capable of evaluating depth L after evaluating the c levels need to bootstrap a ciphertext, the bootstrapping procedure needs to begin with ciphertexts that can be used in a (c+L)-depth circuit. Consequently, let us say that the computation needed a bootstrap a ciphertext is g(λ,c+L) where g(λ,x)=λ^(b) ¹ ·x^(b) ² . The overall per-gate computation is approximately g(λ,L)+g(λ,c+L)/L, a quantity that we seek to minimize.

We have the following lemma.

Lemma 10

Let ƒ(λ,L)=λ^(a) ¹ ·L^(a) ² and g(λ,L)=λ^(b) ¹ ·L^(b) ² for constants b₁>a₁ and b₂>a₂≧1. Let h(λ,L)=ƒ(λ,L)+g(λ,c+L)/L for c=θ(log λ). Then, for fixed λ, h(λ,L) has a minimum for Lε[(c−1)/(b₂−1),c/(b₂−1)]—i.e., at some L=θ(log λ).

Proof.

Clearly h(λ, L)=+∞ at L=0, then it decreases toward a minimum, and finally it eventually increases again as L goes toward infinity. Thus, h(λ,L) has a minimum at some positive value of L. Since ƒ(λ,L) is monotonically increasing (i.e., the derivative is positive), the minimum must occur where the derivative of g(λ,c+L)/L is negative. We have

${{\frac{\mathbb{d}}{\mathbb{d}L}{{g\left( {\lambda,{c + L}} \right)}/L}} = {{{{g^{\prime}\left( {\lambda,{c + L}} \right)}/L} - {{g\left( {\lambda,{c + L}} \right)}/L^{2}}} = {{{b_{2} \cdot \lambda^{b_{1}} \cdot {\left( {c + L} \right)^{b_{2} - 1}/L}} - {\lambda^{b_{1}} \cdot {\left( {c + L} \right)^{b_{2}}/L^{2}}}} = {\left( {\lambda^{b_{1}} \cdot {\left( {c + L} \right)^{b_{2} - 1}/L^{2}}} \right) \cdot \left( {{b_{2} \cdot L} - c - L} \right)}}}},$ which becomes positive when L≧c/(b₂−1)—i.e., the derivative is negative only when L=O(log λ). For L<(c−1)/(b₂−1), we have that the above derivative is less than −λ^(b) ¹ ·(c+L)^(b) ² ⁻¹/L², which dominates the positive derivative of f. Therefore, for large enough value of λ, the value h(λ,L) has its minimum at some Lε[(c−1)/(b₂−1), c/(b₂−1)].

This lemma basically says that, since homomorphic decryption takes θ(log λ) levels and its cost is super-linear and dominates that of normal homomorphic operations (FHE.Add and FHE.Mult), it makes sense to bootstrap lazily—in particular, once every θ(log λ) levels. (If one bootstrapped even more lazily than this, the super-linear cost of bootstrapping begins to ensure that the (amortized) per-gate cost of bootstrapping alone is increasing.) It is easy to see that, since the per-gate computation is dominated by bootstrapping, bootstrapping lazily every θ(log λ) levels reduces the per-gate computation by a factor of θ(log λ).

5.3 Batching the Bootstrapping Operation

Suppose that we are evaluating a circuit homomorphically, that we are currently at a level in the circuit that has at least d gates (where d is the dimension of our ring), and that we want to bootstrap (refresh) all of the ciphertexts corresponding to the respective wires at that level. That is, we want to homomorphically evaluate the decryption function at least d times in parallel. This seems like an ideal place to apply batching.

However, there are some nontrivial problems. In Section 5.1, our focus was rather limited. For example, we did not consider whether homomorphic operations could continue after the batched computation. Indeed, at first glance, it would appear that homomorphic operations cannot continue, since, after batching, the encrypted data is partitioned into non-interacting relatively-prime plaintext slots, whereas the whole point of homomorphic encryption is that the encrypted data can interact (within a common plaintext slot). Similarly, we did not consider homomorphic operations before the batched computation. Somehow, we need the input to the batched computation to come pre-partitioned into the different plaintext slots.

What we need are Pack and Unpack functions that allow the batching procedure to interface with “normal” homomorphic operations. One may think of the Pack and Unpack functions as an on-ramp to and an exit-ramp from the “fast lane” of batching. Let us say that normal homomorphic operations will always use the plaintext slot R_(P) ₁ . Roughly, the Pack function should take a bunch of ciphertexts c₁, . . . , c_(d) that encrypt messages m₁, . . . , m_(d)εZ_(p) under key s₁ for modulus q and plaintext slot R_(p) ₁ , and then aggregate them into a single ciphertext c under some possibly different key s₂ for modulus q, so that correctness holds with respect to all of the different plaintext slots—i.e. m_(i)=[[

c,s₂

]_(q)]_(p) _(i) for all i. The Pack function thus allows normal homomorphic operations to feed into the batch operation. The Unpack function should accept the output of a batched computation, namely a ciphertext c′ such that m_(i)=[[

c′,s_(1′)

]_(q)]_(p) _(i) for all i, and then de-aggregate this ciphertext by outputting ciphertexts c_(1′), . . . , c_(d′) under some possibly different common secret key s_(2′) such that m_(i)=[[

c_(i′),s_(2′)

]_(q)]_(p) ₁ for all i. Now that all of the ciphertexts are under a common key and plaintext slot, normal homomorphic operations can resume. With such Pack and Unpack functions, we could indeed batch the bootstrapping operation. For circuits of large width (say, at least d) we could reduce the per-gate bootstrapping computation by a factor of d, making it only quasi-linear in λ. Assuming the Pack and Unpack functions have complexity at most quasi-quadratic in d (per-gate this is only quasi-linear, since Pack and Unpack operate on d gates), the overall per-gate computation of a batched-bootstrapped scheme becomes only quasi-linear.

Here, we describe suitable Pack and Unpack functions. These functions will make heavy use of the automorphisms σ_(i→j) over R that map elements of p_(i) to elements of p_(j). (See Section 5.1.1.) We note that Smart and Vercauteren [23] used these automorphisms to construct something similar to our Pack function (though for unpacking they resorted to bootstrapping). We also note that Lyubashevsky, Peikert and Regev [15] used these automorphisms to permute the ideal factors q_(i) of the modulus q, which was an essential tool toward their proof of the pseudorandomness of RLWE.

Toward Pack and Unpack procedures, we begin with the observation that if m is encoded as a number in {0, . . . , p−1} and if m=[[

c,s

]_(q)]p_(i), then m=[[

σ_(i→j)(c),σ_(i→j)(s)

]_(q)]p_(j). That is, we can switch the plaintext slot but leave the decrypted message unchanged by applying the same automorphism to the ciphertext and the secret key. (These facts follow from the fact that σ_(i→j) is a homomorphism, that it maps elements of p_(i) to elements of p_(j), and that it fixes integers.) Of course, then we have a problem: the ciphertext is now under a different key, whereas we may want the ciphertext to be under the same key as other ciphertexts. To get the ciphertexts to be back under the same key, we simply use the SwitchKey algorithm to switch all of the ciphertexts to a new common key.

Some technical remarks before we describe Pack/Unpack more formally: We mention again that E.PublicKeyGen is modified in the obvious way so that A·s=p·e rather than 2·e, and that this modification induces a similar modification in SwitchKeyGen. Also, let uεR be a short element such that uε1+p₁ and uεp_(j) for all j≠1. It is obvious that such a u with coefficients in (−p/2, p/2] can be computed efficiently by first picking any element u′ such that u′ε1+p₁ and u′εp_(j) for all j≠1, and then reducing the coefficients of u′ modulo p.

PackSetup(s₁,s₂):

Takes as input two secret keys s₁,s₂. For all iε[1,d], it runs τ_(σ) _(1→i) _((s) ₁ _()→s) ₂ ←SwitchKeyGen(σ_(1→i)(s₁),s₂).

Pack({c_(i)}_(i=1) ^(d),{τ_(∝) _(1→i) _((s) ₁ _()→s) ₂ }_(i=1) ^(d)):

Takes as input ciphertexts c₁, . . . , c_(d) such that m_(i)=[[

c_(i),s₁

]_(q)]_(p) ₁ and 0=[[

c_(i),s₁

]_(q)]_(p) _(j) for all j≠1, and also some auxiliary information output by PackSetup. For all i, it does the following:

-   -   Computes c_(i)*←σ_(1→i)(c_(i)). (Observe: We have m_(i)=[[         c_(i)*,σ_(1→i)(s₁)         ]_(q)]_(p) _(i) while 0=[[         c₁*,σ_(1→i)(s₁)         ]_(q)]_(p) _(j) for all j≠i.)     -   Runs c_(i) ^(†)←SwitchKey(τ_(σ) _(1→i) _((s) ₁ _()→s) ₂ ,c_(i)*)         (Observe: Assuming the noise does not wrap, we have that         m_(i)=[[         c_(i) ^(†),s₂         ]_(q)]_(p) _(i) and 0=[[         c_(i) ^(\),s₂         ]_(q)]_(p) _(j) for all j≠i.)

Finally, it outputs c←Σ_(i=1) ^(d)c_(i) ^(\). (Observe: Assuming the noise does not wrap, we have m_(i)=[[

c,s₂

]_(q)]_(p) _(i) for all i.)

UnpackSetup(s₁,s₂):

Takes as input secret keys s₁,s₂. For all iε[1,d], it runs τ_(σ) _(i→1) _((s) ₁ _()→s) ₂ ←SwitchKeyGen(σ_(i→1)(s₁),s₂).

Unpack(c,{τ₉₄ _(i→1) _((s) ₁ _()→s) ₂ }_(i=1) ^(d)):

Takes as input a ciphertext c such that m_(i)=[[

c,s₁

]_(q)]_(p) _(i) for all i, and also some auxiliary information output by UnpackSetup. For all i, it does the following:

-   -   Computes c_(i)←u·σ_(i→1)(c). (Observe: Assuming the noise does         not wrap, m_(i)=[[         c_(i),σ_(i→1)(s₁)         ]_(q)]_(p) ₁ and 0=[[         c_(i),σ_(i→1)(s₁)         ]_(q)]_(p) _(j) for all j≠1.)     -   Outputs C_(i)*→SwitchKey(τ_(σ) _(i→1) _((s) ₁ _()→s) ₂ ,c_(i)).         (Observe: Assuming the noise does not wrap, m_(i)=[[         c_(i)*,s₂         ]_(q)]_(p) ₁ and 0=[[         c_(i)*,s₂         ]_(q)]_(p) _(j) for all j≠1.)

Splicing the Pack and Unpack procedures into our scheme FHE is tedious but pretty straightforward. Although these procedures introduce many more encrypted secret keys, this does not cause a circular security problem as long as the chain of encrypted secret keys is acyclic; then the standard hybrid argument applies. After applying Pack or Unpack, one may apply modulus reduction to reduce the noise back down to normal.

5.4 More Fun with Funky Plaintext Spaces

In some cases, it might be nice to have a plaintext space isomorphic to Z_(p) for some large prime p—e.g., one exponential in the security parameter. So far, we have been using R_(p) as our plaintext space, and (due to the rounding step in modulus switching) the size of the noise after modulus switching is proportional to p. When p is exponential, our previous approach for handling the noise (which keeps the magnitude of the noise polynomial in λ) obviously breaks down.

To get a plaintext space isomorphic to Z_(p) that works for exponential p, we need a new approach. Instead of using an integer modulus, we will use an ideal modulus I (an ideal of R) whose norm is some large prime p, but such that we have a basis B₁ of I that is very short—e.g. ∥B₁∥=O(poly(d)·p^(1/d)). Using an ideal plaintext space forces us to modify the modulus switching technique nontrivially.

Originally, when our plaintext space was R₂ each of the moduli in our “ladder” was odd—that is, they were all congruent to each other modulo 2 and relatively prime to 2. Similarly, we will have to choose each of the moduli in our new ladder so that they are all congruent to each other modulo I and relatively prime to I. (This just seems necessary to get the scaling to work, as the reader will see shortly.) This presents a difficulty, since we wanted the norm of I to be large—e.g., exponential in the security parameter. If we choose our moduli q_(j) to be integers, then we have that the integer q_(j+1)−q_(j)εI—in particular, q_(j+1)−q_(j) is a multiple of I's norm, implying that the q_(j)'s are exponential in the security parameter. Having such large q_(j)'s does not work well in our scheme, since the underlying lattice problems becomes easy when q_(j)/B is exponential in d where B is a bound of the noise distribution of fresh ciphertexts, and since we need B to remain quite small for our new noise management approach to work effectively. So, instead, our ladder of moduli will also consist of ideals—in particular, principle ideals (q_(j)) generated by an element of q_(j)εR. Specifically, it is easy to generate a ladder of q_(j)'s that are all congruent to 1 moduli I by sampling appropriately-sized elements q_(j) of the coset 1+I (using our short basis of I), and testing whether the principal ideal (q_(j)) generated by the element has appropriate norm.

Now, let us reconsider modulus switching in light of the fact that our moduli are now principal ideals. We need an analogue of Lemma 4 that works for ideal moduli.

Let us build up some notation and concepts that we will need in our new lemma. Let P_(q), be the half-open parallelepiped associated to the rotation basis of qεR. The rotation basis B_(q) of q is the d-dimensional basis formed by the coefficient vectors of the polynomials x^(i)q(x) mod ƒ(x) for iε[0,d−1]. The associated parallelepiped is P_(q)={Σx_(i)·b_(i):b_(i)εB_(q),z_(i)ε[−½,½)}. We need two concepts associated to this parallelepiped. First, we will still use the notation [a]₉, but where q is now an R-element rather than integer. This notation refers to a reduced modulo the rotation basis B_(q) of q—i.e., the element [a]_(q) such that [a]_(q)−aεqR and [a]_(q)εP_(q). Next, we need notions of the inner radius r_(q,in) and outer radius r_(q,out) of P_(q)—that is, the largest radius of a ball that is circumscribed by P_(q), and the smallest radius of a ball that circumscribes P_(q). It is possible to choose q so that the ratio r_(q,out)/r_(q,in) is poly(d). For example, this is true when q is an integer. More generally, if q is sampled uniformly from a ball of radius R with center T·e₁ for T?R, so that q's coefficient vector is “almost parallel” to e₁, one can show (for appropriate values of R and T) that r_(q,out)/r_(q,in) will be poly(d). Choosing q in such a manner, one can also ensure that ∥q⁻¹∥=1/∥q∥ up to a poly(d) factor. (∥q⁻¹∥ refers to the Euclidean norm of the coefficient vector of the inverse of q in the overlying field Q(x)/ƒ(x).) For convenience, let α(d) be a polynomial such that ∥q⁻¹∥=1/∥q∥ up to a α(d) factor and moreover r_(q,out)/r_(q,in) is at most α(d) with overwhelming probability. For such an α, we say q is α-good.

Of course, not every (not even a high proportion) of primes are the norm of a principal ideal generated by an α-good qεR. But there many such primes, and that may suffice for many applications.

Below, we will also use r_(B,out) to denote the outer radius associated to the parallelepiped determined by basis B.

Lemma 11

Let q₁ and q₂, ∥q₁∥<∥q₂∥, be two α-good elements of R. Let B_(I) be a short basis (with outer radius r_(B) _(I) _(,out)) of an ideal I of R such that q₁−q₂εI. Let c be an integer vector and c′←Scale(c,q₂,q₁,I)—that is, c′ is an R-element at most 2r_(B) _(I) _(,out) distant from (q₁/q₂)·c such that c′−cεI. Then, for any s with

${\left\lbrack \left\langle {c,s} \right\rangle \right\rbrack_{q_{2}}} < \frac{\left( {{r_{q_{2},{in}}/{\alpha(d)}^{2}} - {\left( {{q_{2}}/{q_{1}}} \right){\gamma_{R} \cdot 2}{r_{B_{I},{out}} \cdot {\ell_{1}^{(R)}(s)}}}} \right)}{\left( {{\alpha(d)} \cdot \gamma_{R}^{2}} \right)}$ we have [

c′,s

] _(q) ₁ =[

c,s

] _(q) ₂ mod I and ∥[

c′,s

] _(q) ₁ ∥<α(d)·γ_(R) ²·(∥q ₁ ∥/∥q ₂∥)·∥[

c,s

] _(q) ₂ ∥+γ_(R)·2r _(B) _(I) _(,out) ·l ₁ ^((R))(s) where l₁ ^((R))(s) is defined as Σ_(i)∥s[i]∥.

(Proof Omitted.)

With this extension of modulus switching, we can use plaintext spaces that are very large (exponential in the security parameter) and that have properties that are often desirable (such as being isomorphic to a large prime field).

5.5 Other Optimizations

If one is willing to assume circular security, the keys {s_(j)} may all be the same, thereby permitting a public key of size independent of L.

While it is not necessary, squashing may still be a useful optimization in practice, as it can be used to lower the depth of the decryption function, thereby reducing the size of the largest modulus needed in the scheme, which may improve efficiency.

6 Summary

Our RLWE-based FHE scheme without bootstrapping requires only Õ(λ·L³) per-gate computation where L is the depth of the circuit being evaluated, while the bootstrapped version has only Õ(2²) per-gate computation. For circuits of width Ω(λ), we can use batching to reduce the per-gate computation of the bootstrapped version by another factor of λ. In follow-on work, Gentry, Halevi and Smart [11] show that the per-gate overhead can be further reduced to polylogarithmic in the security parameter.

While these schemes should perform significantly better than previous FHE schemes, we caution that the polylogarithmic factors in the per-gate computation are large.

Another way to view exemplary embodiments of the invention is in terms of both a noise “ceiling” and a noise “floor.” The noise ceiling is the modulus. Recall that a ciphertext c mod p has vector coefficients on the range (−p/2,p/2). If the noise becomes too large relative to the modulus (the range for coefficients defined by the modulus) then decryption of the ciphertext may fail and correctness is lost. As previously noted, it is not just the ratio of the noise to the “noise ceiling” that is important—one must also consider the (absolute) magnitude of the noise, especially in multiplications. Since multiplications have the capacity to increase the magnitude of the noise exponentially or even at doubly exponential growth (e.g., x^(2L) for L multiplications/levels), the magnitude of the noise constrains the number of levels L that can be used. This is what leads the bootstrapping approach only to enable logarithmic depth circuits. Thus, the magnitude of the noise may be considered the noise floor—an upper bound on the size of the actual noise. The goal is to have the noise floor meet the noise ceiling as slowly as possible since this will allow for more levels and more operations to be performed homomorphically.

Using exemplary embodiments of the invention, the noise floor remains fixed and the noise ceiling is reduced by a fixed factor (p/q) due to modulus switching. To maintain control over the approach of the noise ceiling, it is preferable to perform the modulus switching (refresh function) at least after each multiplication. Since addition does not lead to noise growth as fast as multiplication does, it may be possible to perform more than one addition before refreshing the ciphertext with the modulus switching technique.

In comparison to BV [3], it is noted that BV uses dimension reduction which is a kind of “bundled” key-switching and modulus-switching step. The dimension reduction is applied once (once each epoch between bootstrappings) immediately before bootstrapping to get the ciphertext to be very small (which simplifies the bootstrapping process). BV lets the noise grow uncontrolled (e.g., exponentially) and uses dimension reduction at the end (right before bootstrapping). Since the noise is allowed to grow uncontrolled, big parameters must be used (the ciphertexts are vectors of big dimension with big coefficients).

Contrasting with BV, the exemplary embodiments of the invention use modulus switching as a separate step, not bundled with key switching per se. Since this step merely involves multiplying by a known fraction and rounding appropriately, it can be performed even without any public key material—in particular without the “evaluation key” that BV needs to perform dimension reduction. Furthermore, the exemplary embodiments of the invention use modulus switching iteratively as an aggressive noise management technique rather than only as a precursor to bootstrapping.

7. REFERENCES [1] B. Applebaum, D. Cash, C. Peikert, and A. Sahai. Fast cryptographic primitives and circular-secure encryption based on hard learning problems. In CRYPTO, volume 5677 of Lecture Notes in Computer Science, pages 595-618. Springer, 2009. [2] D. Boneh, E. -J. Goh, and K. Nissim. Evaluating 2-DNF formulas on ciphertexts. In Proceedings of Theory of Cryptography Conference 2005, volume 3378 of LNCS, pages 325-342, 2005. [3] Z. Brakerski and V. Vaikuntanathan. Efficient fully homomorphic encryption from (standard) lwe. Manuscript, to appear in FOCS 2011, available at http://eprint.iacr.org/2011/344. [4] Z. Brakerski and V. Vaikuntanathan. Fully homomorphic encryption from ring-lwe and security for key dependent messages. Manuscript, to appear in CRYPTO 2011. [5] J. -S. Coron, A. Mandal, D. Naccache, and M. Tibouchi. Fully-homomorphic encryption over the integers with shorter public-keys. Manuscript, to appear in Crypto 2011. [6] M. Dijk, C. Gentry, S. Halevi, and V. Vaikuntanathan. Fully homomorphic encryption over the integers. In Advances in Cryptology - EUROCRYPT′10, volume 6110 of Lecture Notes in Computer Science, pages 24-43. Springer, 2010. Full Version available on-line from http://eprint.iacr.org/2009/616. [7] C. Gentry. A fully homomorphic encryption scheme. PhD thesis, Stanford University, 2009. crypto.stanford.edu/craig. [8] C. Gentry. Fully homomorphic encryption using ideal lattices. In M. Mitzenmacher, editor, STOC, pages 169-178. ACM, 2009. [9] C. Gentry and S. Halevi. Fully homomorphic encryption without squashing using depth-3 arithmetic circuits. Manuscript, to appear in FOCS 2011, available at http://eprint.iacr.org/2011/279. [10] C. Gentry and S. Halevi. Implementing gentry's fully-homomorphic encryption scheme. In EUROCRYPT, volume 6632 of Lecture Notes in Computer Science, pages 129-148. Springer, 2011. [11] C. Gentry, S. Halevi, and N. P. Smart. Fully homomorphic encryption with polylog overhead. Manuscript at http://eprint.iacr.org/2011/566, 2011. [12] S. Halevi, 2011. Personal communication. [13] Y. Ishai and A. Paskin. Evaluating branching programs on encrypted data. In S. P. Vadhan, editor, TCC, volume 4392 of Lecture Notes in Computer Science, pages 575-594. Springer, 2007. [14] K. Lauter, M. Naehrig, and V. Vaikuntanathan. Can homomorphic encryption be practical? Manuscript at http://eprint.iacr.org/2011/405, 2011. [15] V. Lyubashevsky, C. Peikert, and O. Regev. On ideal lattices and learning with errors over rings. In EUROCRYPT, volume 6110 of Lecture Notes in Computer Science, pages 1-23, 2010. [16] C. A. Melchor, P. Gaborit, and J. Herranz. Additively homomorphic encryption with -operand multiplications. In T. Rabin, editor, CRYPTO, volume 6223 of Lecture Notes in Computer Science, pages 138-154, Springer, 2010. [17] D. Micciancio. Generalized compact knapsacks, cyclic lattices, and efficient one-way functions. Computational Complexity, 16(4): 365-411, December 2007. Preliminary version in FOCS 2002. [18] C. Peikert. Public-key cryptosystems from the worst-case shortest vector problem: extended abstract. In STOC, pages 333-342. ACM, 2009. [19] O. Regev. On lattices, learning with errors, random linear codes, and cryptography. In H. N. Gabow and R. Fagin, editors, STOC, pages 84-93. ACM, 2005. [20] O. Regev. The learning with errors problem (invited survey). In IEEE Conference on Computational Complexity, pages 191-204. IEEE Computer Society, 2010. [21] R. Rivest, L. Adleman, and M. L. Dertouzos. On data banks and privacy homomorphisms. In Foundations of Secure Computation, pages 169-180, 1978. [22] N. P. Smart and F. Vercauteren. Fully homomorphic encryption with relatively small key and ciphertext sizes. In Public Key Cryptography - PKC′10, volume 6056 of Lecture Notes in Computer Science, pages 420-443. Springer, 2010. [23] N. P. Smart and F. Vercauteren. Fully homomorphic SIMD operations. Manuscript at http://eprint.iacr.org/2011/133, 2011. [24] D. Stehlé and R. Steinfeld. Faster fully homomorphic encryption. In ASIACRYPT, volume 6477 of Lecture Notes in Computer Science, pages 377-394. Springer, 2010.

8. Apparatus and Computer Programs

FIG. 1 illustrates a block diagram of an exemplary system in which various exemplary embodiments of the invention may be implemented. The system 100 may include at least one circuitry 102 (e.g., circuitry element, circuitry components, integrated circuit) that may in certain exemplary embodiments include at least one processor 104. The system 100 may also include at least one memory 106 (e.g., a volatile memory device, a non-volatile memory device), and/or at least one storage 108. The storage 108 may include a non-volatile memory device (e.g., EEPROM, ROM, PROM, RAM, DRAM, SRAM, flash, firmware, programmable logic, etc.), magnetic disk drive, optical disk drive and/or tape drive, as non-limiting examples. The storage 108 may comprise an internal storage device, an attached storage device and/or a network accessible storage device, as non-limiting examples. The system 100 may include at least one program logic 110 including code 112 (e.g., program code) that may be loaded into the memory 106 and executed by the processor 104 and/or circuitry 102. In certain exemplary embodiments, the program logic 110, including code 112, may be stored in the storage 108. In certain other exemplary embodiments, the program logic 110 may be implemented in the circuitry 102. Therefore, while FIG. 1 shows the program logic 110 separately from the other elements, the program logic 110 may be implemented in the memory 106 and/or the circuitry 102, as non-limiting examples.

The system 100 may include at least one communications component 114 that enables communication with at least one other component, system, device and/or apparatus. As non-limiting examples, the communications component 114 may include a transceiver configured to send and receive information, a transmitter configured to send information and/or a receiver configured to receive information. As a non-limiting example, the communications component 114 may comprise a modem or network card. The system 100 of FIG. 1 may be embodied in a computer or computer system, such as a desktop computer, a portable computer or a server, as non-limiting examples. The components of the system 100 shown in FIG. 1 may be connected or coupled together using one or more internal buses, connections, wires and/or (printed) circuit boards, as non-limiting examples.

It should be noted that in accordance with the exemplary embodiments of the invention, one or more of the circuitry 102, processor(s) 104, memory 106, storage 108, program logic 110 and/or communications component 114 may store one or more of the various items (e.g., public/private key(s), ciphertexts, encrypted items, matrices, variables, equations, formula, operations, operational logic, logic) discussed herein. As a non-limiting example, one or more of the above-identified components may receive and/or store the information (e.g., to be encrypted, resulting from decryption) and/or the ciphertext (e.g., to be decrypted, to be operated on homomorphically, resulting from encryption). As a further non-limiting example, one or more of the above-identified components may receive and/or store the encryption function(s) and/or the decryption function(s), as described herein.

The exemplary embodiments of this invention may be carried out by computer software implemented by the processor 104 or by hardware, or by a combination of hardware and software. As a non-limiting example, the exemplary embodiments of this invention may be implemented by one or more integrated circuits. The memory 106 may be of any type appropriate to the technical environment and may be implemented using any appropriate data storage technology, such as optical memory devices, magnetic memory devices, semiconductor-based memory devices, fixed memory and removable memory, as non-limiting examples. The processor 104 may be of any type appropriate to the technical environment, and may encompass one or more of microprocessors, general purpose computers, special purpose computers and processors based on a multi-core architecture, as non-limiting examples.

9. Further Exemplary Embodiments

There is one further extension of the exemplary embodiments of the invention that merits brief consideration. As described above, in some exemplary embodiments the noise (i.e., the magnitude of the noise) remains (relatively, substantially) constant and low level while the modulus decreases. Thus, the gap between the modulus (e.g., ceiling) and the noise (e.g., floor) eventually runs out. As noted above, and as non-limiting examples, one can set the initial modulus big enough to enable the polynomial depth circuit to be executed by the time the gap runs out or one can utilize bootstrapping once the gap runs out. Also as noted above, compared with the bootstrapping approach's logarithmic depth for circuits, the exemplary embodiments of the invention enable a polynomial depth circuit of multiplications starting with parameters that are only polynomial in size.

Another way to look at exemplary embodiments of the invention, that is instead of keeping the floor (magnitude of noise) constant and slowly reducing the ceiling (modulus), is by normalizing to the modulus. For example, set the modulus to 1 and utilize fractions (i.e., values less than 1). In this case, the noise is represented as a fractional part of the coefficients and proceeds to increase until it approaches the modulus (which has a value of 1). While this would appear to maintain the ceiling as a constant and slowly increase the floor until one runs out of room, it is simply an extension of the above-described concepts with the coefficients normalized to the modulus (as evident by the modulus having a value of 1). In particular, note that the growth of the noise would be similar, if not identical, in speed and/or value to the described reduction of the modulus. In both cases, it is a technique that involves maintaining one of the floor or ceiling at a (relatively, substantially) constant value and having the other one approach (e.g., decrease for the ceiling or increase for the floor) by utilizing what amounts to different moduli. Modulus switching essentially suggests that a particular value of the modulus is irrelevant since it is the magnitude of the noise that may be accounted for instead (e.g., as opposed to the ratio of noise to modulus).

Below are further descriptions of various non-limiting, exemplary embodiments of the invention. The below-described exemplary embodiments are numbered separately for clarity purposes. This numbering should not be construed as entirely separating the various exemplary embodiments since aspects of one or more exemplary embodiments may be practiced in conjunction with one or more other aspects or exemplary embodiments.

(1) In one exemplary embodiment of the invention, and as shown in FIG. 3, a method comprising: receiving (301) a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; performing (302) at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and reducing (303) a noise level of the third ciphertext by using the refresh function.

A method as above, where application of the refresh function to a ciphertext prevents growth of the magnitude of noise for the ciphertext while reducing a range of coefficients for the ciphertext. A method as in any above, where the at least one operation function comprises at least one of an addition and a multiplication, where in response to a multiplication being performed the refresh function is applied. A method as in any above, where the at least one operation function comprises at least one of an addition and a multiplication, where in response to a multiplication being desired the refresh function is applied to at least one input of the multiplication. A method as in any above, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications.

A method as in any above, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications without (performing, utilizing) bootstrapping. A method as in any above, further comprising: batching evaluations of a plurality of ciphertexts across a same circuit. A method as in any above, further comprising: performing bootstrapping (e.g., for optimization). A method as in any above, further comprising: performing batching of evaluations of a plurality of ciphertexts across a same circuit while also performing bootstrapping. A method as in any above, further comprising: performing squashing.

A computer program comprising machine readable instructions which when executed by an apparatus control it to perform the method as in any one of the preceding claims. A method as in any above, implemented as a computer program. A method as in any above, implemented as a program of instructions stored (e.g., tangibly embodied) on a program storage device (e.g., at least one memory, at least one computer-readable medium) and executable by a computer (e.g., at least one processor). A method as in any above, further comprising one or more aspects of the exemplary embodiments of the invention as described further herein.

(2) In another exemplary embodiment of the invention, and as shown in FIG. 3, a computer-readable storage medium storing program instructions, execution of the program instructions resulting in operations comprising: receiving (301) a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; performing (302) at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and reducing (303) a noise level of the third ciphertext by using the refresh function.

A computer readable medium as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(3) In a further exemplary embodiment of the invention, an apparatus comprising: at least one processor configured to receive a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; and at least one memory configured to store the first ciphertext and the second ciphertext, where the at least one processor is further configured to perform at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and to reduce a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(4) In another exemplary embodiment of the invention, an apparatus comprising: means for receiving (e.g., at least one input, at least one processor, at least one receiver) a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; means for performing (e.g., at least one processor, at least one circuit, at least one function, at least one logic circuit, at least one integrated circuit) at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and means for reducing (e.g., at least one processor, at least one circuit, at least one function, at least one logic circuit, at least one integrated circuit) a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising means for storing at least one ciphertext (e.g., the first ciphertext, the second ciphertext, and/or the third ciphertext). An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(5) In a further exemplary embodiment of the invention, an apparatus comprising: reception circuitry configured to receive a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; operation circuitry configured to perform at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and noise reduction circuitry (e.g., control circuitry) configured to reduce a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(6) In another exemplary embodiment of the invention, an apparatus comprising: a first system configured to receive a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of the magnitude of noise for a ciphertext while reducing the modulus of the ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming a first ciphertext c modulo q into a second ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; a second system configured to perform at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and a third system configured to reduce a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(7) In one exemplary embodiment of the invention, and as shown in FIG. 4, a method comprising: receiving (401) a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of the magnitude of noise for a ciphertext while maintaining the modulus of the ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; performing (402) at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and reducing (403) a noise level of the third ciphertext by using the refresh function.

A method as above, where the modulus of the ciphertext is maintained at a value of 1. A method as in any above, where the magnitude of noise for the ciphertext is represented as a fractional part of coefficients for the ciphertext. A method as in any above, where the at least one operation function comprises at least one of an addition and a multiplication, where in response to a multiplication being performed the refresh function is applied. A method as in any above, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications.

A computer program comprising machine readable instructions which when executed by an apparatus control it to perform the method as in any one of the preceding claims. A method as in any above, implemented as a computer program. A method as in any above, implemented as a program of instructions stored (e.g., tangibly embodied) on a program storage device (e.g., at least one memory, at least one computer-readable medium) and executable by a computer (e.g., at least one processor). A method as in any above, further comprising one or more aspects of the exemplary embodiments of the invention as described further herein.

(8) In another exemplary embodiment of the invention, and as shown in FIG. 4, a computer-readable storage medium storing program instructions, execution of the program instructions resulting in operations comprising: receiving (401) a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of the magnitude of noise for a ciphertext while maintaining the modulus of the ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; performing (402) at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and reducing (403) a noise level of the third ciphertext by using the refresh function.

A computer readable medium as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(9) In a further exemplary embodiment of the invention, an apparatus comprising: at least one processor configured to receive a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of the magnitude of noise for a ciphertext while maintaining the modulus of the ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; and at least one memory configured to store the first ciphertext and the second ciphertext, where the at least one processor is further configured to perform at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and to reduce a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(10) In another exemplary embodiment of the invention, an apparatus comprising: means for receiving (e.g., at least one input, at least one processor, at least one receiver) a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of the magnitude of noise for a ciphertext while maintaining the modulus of the ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; means for performing (e.g., at least one processor, at least one circuit, at least one function, at least one logic circuit, at least one integrated circuit) at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and means for reducing (e.g., at least one processor, at least one circuit, at least one function, at least one logic circuit, at least one integrated circuit) a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(11) In a further exemplary embodiment of the invention, an apparatus comprising: reception circuitry configured to receive a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of the magnitude of noise for a ciphertext while maintaining the modulus of the ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; operation circuitry configured to perform at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and noise reduction circuitry (e.g., control circuitry) configured to reduce a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

(12) In another exemplary embodiment of the invention, an apparatus comprising: a first system configured to receive a first ciphertext and a second ciphertext, where the first ciphertext comprises first data encrypted in accordance with an encryption scheme and the second ciphertext comprises second data encrypted in accordance with the encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two ciphertexts and uses the public key to perform at least one operation on the at least two ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of the magnitude of noise for a ciphertext while maintaining the modulus of the ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; a second system configured to perform at least one operation on the first ciphertext and the second ciphertext, using the at least one operation function, to obtain a third ciphertext; and a third system configured to reduce a noise level of the third ciphertext by using the refresh function.

An apparatus as in any above, further comprising one or more additional aspects of the exemplary embodiments of the invention as described herein.

10. Additional Points

The exemplary embodiments of the invention, as discussed herein and as particularly described with respect to exemplary methods, may be implemented in conjunction with a program storage device (e.g., at least one memory, computer-readable memory, computer-readable medium, computer-readable storage medium, computer-readable storage device, non-transitory computer-readable medium such as memory, flash memory, magnetic memory devices, EEPROM, ROM, PROM, RAM, DRAM, SRAM, firmware, programmable logic, etc.) readable by a machine (e.g., a device, apparatus, at least one processor), tangibly embodying a program of instructions (e.g., program, computer program) executable (e.g., by the machine) for performing operations. The operations comprise steps of utilizing the exemplary embodiments or steps of the method.

The blocks shown in FIGS. 3 and 4 further may be considered to correspond to one or more functions and/or operations that are performed by one or more components, circuits, chips, apparatus, processors, computer programs and/or function blocks. Any and/or all of the above may be implemented in any practicable solution or arrangement that enables operation in accordance with the exemplary embodiments of the invention as described herein.

In addition, the arrangement of the blocks depicted in FIGS. 3 and 4 should be considered merely exemplary and non-limiting. It should be appreciated that the blocks shown in FIGS. 3 and 4 may correspond to one or more functions and/or operations that may be performed in any order (e.g., any suitable, practicable and/or feasible order) and/or concurrently (e.g., as suitable, practicable and/or feasible) so as to implement one or more of the exemplary embodiments of the invention. In addition, one or more additional functions, operations and/or steps may be utilized in conjunction with those shown in FIGS. 3 and 4 so as to implement one or more further exemplary embodiments of the invention.

That is, the exemplary embodiments of the invention shown in FIGS. 3 and 4 may be utilized, implemented or practiced in conjunction with one or more further aspects in any combination (e.g., any combination that is suitable, practicable and/or feasible) and are not limited only to the steps, blocks, operations and/or functions shown in FIGS. 3 and 4.

Still further, the various names and/or symbols used for the parameters and/or functions are not intended to be limiting in any respect, as these parameters and functions may be identified by any suitable name and/or symbol.

Any use of the terms “connected,” “coupled” or variants thereof should be interpreted to indicate any such connection or coupling, direct or indirect, between the identified elements. As a non-limiting example, one or more intermediate elements may be present between the “coupled” elements. The connection or coupling between the identified elements may be, as non-limiting examples, physical, electrical, magnetic, logical or any suitable combination thereof in accordance with the described exemplary embodiments. As non-limiting examples, the connection or coupling may comprise one or more printed electrical connections, wires, cables, mediums or any suitable combination thereof.

Generally, various exemplary embodiments of the invention can be implemented in different mediums, such as software, hardware, logic, special purpose circuits or any combination thereof. As a non-limiting example, some aspects may be implemented in software which may be run on a computing device, while other aspects may be implemented in hardware.

The foregoing description has provided by way of exemplary and non-limiting examples a full and informative description of the best method and apparatus presently contemplated by the inventors for carrying out the invention. However, various modifications and adaptations may become apparent to those skilled in the relevant arts in view of the foregoing description, when read in conjunction with the accompanying drawings and the appended claims. However, all such and similar modifications will still fall within the scope of the teachings of the exemplary embodiments of the invention.

Furthermore, some of the features of the preferred embodiments of this invention could be used to advantage without the corresponding use of other features. As such, the foregoing description should be considered as merely illustrative of the principles of the invention, and not in limitation thereof. 

What is claimed is:
 1. A non-transitory computer-readable storage medium storing program instructions, execution of the program instructions resulting in operations comprising: transmitting by a requestor a query to a computer system; receiving at the computer system the query; accessing, at the computer system and from a memory of the computer system, a plurality of ciphertexts, where each of the input ciphertexts comprises data encrypted in accordance with an encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two given ciphertexts and uses the public key to perform at least one operation on the at least two given ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of a magnitude of noise for a provided ciphertext while reducing a modulus of the provided ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming the provided ciphertext c modulo q into another ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; retrieving, by the computer system and from the memory, one or more results corresponding to and satisfying the query by performing homomorphic operations using at least the plurality of input ciphertexts at least by: performing operations on ciphertexts according to a circuit that corresponds to the query and the evaluation of which produces the one or more results that satisfy the query, wherein the operations use the at least one operation function to obtain a ciphertext result, and wherein at least some of the operations involve the plurality of input ciphertexts; reducing a noise level of the ciphertext result by using the refresh function; and determining the one or more results of the evaluation of the circuit at least by evaluating the circuit and iterating the performing the operations and the reducing the noise level multiple times during the evaluation of the circuit; sending by the computer system the one or more results of the evaluation of the circuit to the requestor; and receiving by the requestor the one or more results and decrypting by the requestor the one or more results to determine an answer to the query.
 2. The computer-readable storage medium of claim 1, where application of the refresh function to the provided ciphertext also reduces a range of coefficients for an output of the refresh function, relative to a range of coefficients for the provided ciphertext.
 3. The computer-readable storage medium of claim 1, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being performed the refresh function is applied to an output of the multiplication.
 4. The computer-readable storage medium of claim 1, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being desired the refresh function is applied to at least one input of the multiplication.
 5. The computer-readable storage medium of claim 1, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications and wherein the circuit that corresponds to the query comprises the polynominal depth circuit of multiplications.
 6. A method, comprising: transmitting by a requestor a query to a computer system; receiving at a computer system the query; accessing, at the computer system and from a memory of the computer system, a plurality of input ciphertexts, where each of the input ciphertexts comprises data encrypted in accordance with an encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two given ciphertexts and uses the public key to perform at least one operation on the at least two given ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of a magnitude of noise for a provided ciphertext while reducing a modulus of the provided ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming the provided ciphertext c modulo q into a another ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; retrieving, by the computer system and from the memory, one or more results corresponding to and satisfying the query by performing homomorphic operations using at least the plurality of input ciphertexts at least by: performing operations on ciphertexts according to a circuit that corresponds to the query and the evaluation of which produces the one or more results that satisfy the query, wherein the operations use the at least one operation function to obtain ciphertext result, and wherein at least some of the operations involve the plurality of input ciphertexts; reducing a noise level of the ciphertext result by using the refresh function; and determining the one or more results of the evaluation of the circuit at least by evaluating the circuit and iterating the performing the operations and the reducing the noise level multiple times during the evaluation of the circuit; sending by the computer system the one or more results of the evaluation of the circuit to the requestor; and receiving by the requestor the one or more results and decrypting by the requestor the one or more results to determine an answer to the query.
 7. The method of claim 6, where application of the refresh function to the provided ciphertext also reduces a range of coefficients for an output of the refresh function, relative to a range of coefficients for the provided for the provided ciphertext.
 8. The method of claim 6, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being performed the refresh function is applied to an output of the multiplication.
 9. The method of claim 6, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being desired the refresh function is applied to at least one input of the multiplication.
 10. The method of claim 6, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications and wherein the circuit that corresponds to the query comprises the polynomial depth circuit of multiplications.
 11. An apparatus, comprising: a requestor comprising a first computer system configured to transmit a query to a second computer system; the second computer system configured to perform the following: receive a query from a requestor; access a plurality of input ciphertexts from a memory of the second computer system, the memory configured to store the input ciphertexts, where each of the input ciphertexts comprises data encrypted in accordance with an encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two given ciphertexts and uses the public key to perform at least one operation on the at least two given ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of a magnitude of noise for a provided ciphertext while reducing a modulus of the provided ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming the provided ciphertext c modulo q into another ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; where the second computer system is further configured to retrieve from the memory one or more results corresponding to and satisfying the query by perform homomorphic operations using at least the plurality of input ciphertexts at least by: performing operations on ciphertexts according to a circuit that corresponds to the query and the evaluation of which produces the one or more results that satisfy the query, wherein the operations use the at least one operation function to obtain a ciphertext result, and wherein the at least some of the operations involve the plurality of input ciphertexts; and reducing a noise level of the ciphertext result by using the refresh function; and determining the one or more results of the evaluation of the circuit at least by evaluating the circuit and iterating the performing the operations and the reducing the noise level multiple times during the evaluation of the circuit; where the second computer system is further configured to send the one or more results of the evaluation of the circuit to the requestor; where the requestor is configured to receive the one or more results and to decrypt the one or more results to determine an answer to the query.
 12. The apparatus of claim 11, where application of the refresh function to the provided ciphertext also reduces a range of coefficients for an output of the refresh function, relative to a range of coefficients for the provided ciphertext.
 13. The apparatus of claim 11, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being performed the refresh function is applied to an output of the multiplication.
 14. The apparatus of claim 11, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being desired the refresh function is applied to at least one input of the multiplication.
 15. The apparatus of claim 11, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications and wherein the circuit that corresponds to the query comprises the polynomial depth circuit of multiplications.
 16. An apparatus, comprising: means for transmitting by requestor a query to a computer system; means for receiving at the computer system the query; means for accessing, at the computer system and from a memory of the computer system, a plurality of input ciphertexts, where each of the plurality of ciphertexts comprises data encrypted in accordance with an encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two given ciphertexts and uses the public key to perform at least one operation on the at least two given ciphertexts and obtain a resulting ciphertext, where the refresh function operates to prevent growth of a magnitude of noise for a provided ciphertext while reducing a modulus of the provided ciphertext without using the secret key, where the refresh function utilizes a modulus switching technique that comprises transforming the provided ciphertext c modulo q into another ciphertext c′ modulo p while preserving correctness, where the modulus switching technique includes scaling by p/q and rounding, where p<q, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; means for retrieving, by the computer system and from the memory, one or more results corresponding to and satisfying the query by performing homomorphic operations using at least the plurality of ciphertexts at least using: means for performing operations on the ciphertexts according to a circuit that corresponds to the query and the evaluation of which produces the one or more results that satisfy the query, wherein the operations use the at least one operation function to obtain a ciphertext result, and wherein at least some of the operations involve the plurality of input ciphertexts; means for reducing a noise level of the ciphertext result by using the refresh function; and means for determining the one or more results of the evaluation of the circuit at least by evaluating the circuit and iterating the performing the operations and the reducing the noise level multiple times during the evaluation of the circuit; means for sending by the computer system the one or more results of the evaluation of the circuit to the requestor; and means for receiving by the requestor the one or more results and means for decrypting by the requestor the one or more results to determine an answer to the query.
 17. The apparatus of claim 16, where application of the refresh function to the provided ciphertext also reduces a range of coefficients for an output of the refresh function, relative to a range of coefficients for the provided ciphertext.
 18. The apparatus of claim 16, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being performed the refresh function is applied to an output of the multiplication.
 19. The apparatus of claim 16, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being desired the refresh function is applied to at least one input of the multiplication.
 20. The apparatus of claim 16, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications and wherein the circuit that corresponds to the query comprises the polynomial depth circuit of multiplications.
 21. A method, comprising: transmitting by a requestor a query to a computer system; receiving at the computer system the query; accessing, at the computer system and from a memory of the computer system, a plurality of input ciphertexts, where each of the plurality of ciphertexts comprises data encrypted in accordance with an encryption scheme, where the encryption scheme uses a public key and a secret key and includes an encryption function, a decryption function, at least one operation function and a refresh function, where the encryption function operates to obtain ciphertext by encrypting data using the public key, where the decryption function operates using the secret key to decrypt ciphertext for data encrypted using the public key and obtain the data, where the at least one operation function receives at least two given ciphertexts and uses the public key to perform at least one operation on the at least two given ciphertexts and obtain a resulting ciphertext, where the refresh function operates to enable slow growth of a magnitude of noise for a provided ciphertext while maintaining a modulus of the provided ciphertext constant without using the secret key, where the encryption scheme enables homomorphic operations to be performed on ciphertexts encoded and operated on in accordance with the encryption scheme; retrieving, by the computer system and from the memory, one or more results corresponding to and satisfying the query by homomorphic operations using at least the plurality of input ciphertexts at least by: performing operations on ciphertexts according to a circuit that corresponds to the query and the evaluation of which produces the one or more results that satisfy the query, wherein the operations use the at least one operation function to obtain a ciphertext result, and wherein at least some of the operations involve the plurality of input ciphertexts; reducing a noise level of the ciphertext result by using the refresh function; and determining the one or more results of evaluation of the circuit at least by evaluating the circuit and iterating the performing the operations and the reducing the noise level multiple times during the evaluation of the circuit; sending by the computer system the one or more results of the evaluation of the circuit to the requestor; and receiving by the requestor the one or more results and decrypting by the requestor the one or more results to determine an answer to the query.
 22. The method of claim 21, where the modulus of the provided ciphertext is maintained at a value of
 1. 23. The method of claim 21, where the magnitude of noise for the provided ciphertext is represented as a fractional part of coefficients for the ciphertext.
 24. The method of claim 21, where the at least one operation comprises at least one of an addition and a multiplication, where in response to a multiplication being performed the refresh function is applied to an output of the multiplication.
 25. The method of claim 21, where the encryption scheme enables evaluation of a polynomial depth circuit of multiplications and wherein the circuit that corresponds to the query comprises the polynomial depth circuit of multiplications. 